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ABSTRACT 


The use of a digital comb filter for the separation of two 
speakers was formulated in previous efforts. An adaptive pitch 
synchronous filter has been developed as an alternative. This 
development stems from the characteristics and structure of the 
speech waveform. In the comb filtering development, there were 
tradeoffs between desired speaker distortion and undesired speaker 
separation. In the development of the adaptive filtering tech- 
niques, the results of these tradeoffs will be examined for 
possible improvements over the comb filtering techniques. 

A series of tests performed on test signal inputs compares 
the performance of both the comb and adaptive filtering systems. 
Included in an appendix are several of the main computer programs 
used in the computer implementation. A series of listening tests 
will be performed with these systems in future work making the 
appendix a necessary part of the thesis. 
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CHAPTER | 
INTRODUCTION 
1.1 Introduction and Goals 


The problem of separating a signal waveform from a noise waveform 
has been the topic of unlimited research for many years. In this 
discussion the problem of interest iS one of separating a speech wave- 
form from a "noise waveform" which may be the speech waveform from a 
competing speaker. Two methods for speech enhancement are developed 


in this Thesis: comb filtering and adaptive filtering. 


The remainder of Chapter | describes the speech waveform and its 
characteristics in a manner that will be helpful for terminology and 
modeling. In Chapter |l a review of the previous systems dealing with 
this specific problem will be covered, and the problem formulation 
introduction for this thesis will be outlined. Chapter II! deals with 
the algorithm that was used in determining the fundamental frequency 
of a speech waveform including some of the background research that was 
developed on this method. In Chapter IV the ег” filtering methods 
are formulated along with a discussion of the rules used and problems 
that should be encountered with the system. Chapter V concentrates on 
the test signal formulation, processing, and results for the different 
filtering systems implemented. This chapter also deals with the 


comparison methods that will be used between the outputs of the 





2122 
different systems. The computer implementation of the systems will 
be discussed in Chapter VI, and Chapter VII describes the results of 
the actual speech waveforms that were processed. Included in this 
chapter are spectrograms of the inputs and outputs for the systems used. 
The overall results and conclusions are covered in Chapters VIII and IX 
respectively. The Appendices contain computer programs, flowcharts, 
and documentation for the purpose of making the continuing work in this 


problem easier for those persons concerned. 


The goals of fhis thesis can be summarized in the following 
manner: First, the work that had previously been done was to be 
duplicated. This involved implementing the computer system described 
by Vaden Shields. | The purpose of this work was to arrive at a 
starting location before any other work was begun. The system was also 
implemented for the purpose of comparison with any systems that were 
generated by future work. Second, a pitch synchronous adaptive filter- 
ing scheme was developed and implemented on the computer. Now with 
these two completely separate systems, their results could be compared 
to give the third goal. Finally, the ¡ideas of comb filtering and 


adaptive filtering were examined in order to find a limit to their 


effectiveness on the speech enhancement problem, 


1.2 The Speech Waveform and its Characteristics 


Before the speech waveform can be processed by any type of sys- 
tem, the structure, characteristics, and properties of the waveform 
must be understood. A thorough understanding of these items along 
with the definition of several terms is necessary before the various 


processing systems are discussed. 





ais 
Speech production takes place in an area of the body referred to 

as the vocal tract. The vocal tract can be best described as an 

acoustical tube of nonuniform cross-sectional area that originates 

at the vocal cords and terminates at the lips. The vocal tract of 

the average male is 17 centimeters long, and is shorter in the average 

female adult. The nasal tract which lies between the velum and nostrils 

is another nonuniform cross-sectional area tube that may be acoustically 


coupled to the vocal tract by the opening at the velum. 


Within the vocal and nasal tracts lie the major components for 
speech production. The articulators, composed of the lips, jaws, tongue, 
and velum vary with the size and shape of the tract. The velum is 
also referred to as the soft palate and controls the acoustical coupling 
between the vocal and nasal tracts. The vocal cords lie at the lower 
end of the vocal tract. The vocal cords consist of a pair of lips 
made of ligament and muscle, and the opening between the vocal cords is 
referred to as the glottis. The words subglottal and supraglottal 
that are used frequently in the literature refer respectively to the 


areas immmediately below and above the glottal opening. 


In general there are three methods of producing speech waveforms. 
First and most important, voiced sounds are produced by exciting the 
vocal tract with quasi-periodic pulses of air that originate in the 
lungs as a steady flow and are chopped into pulses by the vocal cord 
vibrations. Second, the vocal tract is constricted in one area, and 
then by means of forcing the air from the lungs through the constric- 
tion, a turbulent air flow is created. Fricafive sounds are produced 


in this manner. Plosive sounds are created by closing the vocal tract 
3 
momentarily, building up a pressure, and then, releasing it. 
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Usually sounds that use the vocal cord vibrations for excitation 
are referred to as voiced, and those that do not use the vocal cords 
are referred to as unvoiced. The sounds that are produced with the 
use of the nasal tract are named nasal sounds. For these types of 


sounds the nasal tract is coupled to the vocal tract at the velum. 


The terms pitch, pitch period, and fundamental frequency can all 
be defined at the same time. These terms have been defined in many 
ways by different authors. G. Fant defines them in them in the follow- 


ing manner: 


The basic property of a vocal cord sound is its peri- 
odicity expressed by the duration T, of a complete 
voice period or by the inverse value of the voice 
fundamental frequency. 


Jm iS 


Fundamental pitch and fundamental frequency are not 
synonymous, but these terms can be used interchange- 
ably due to the close one-to-one correspondence. In 
more strict terminology pitch is a tonal sensation 
and frequency a property of the sound stimulus. The 
duration of a pitch cycle always varies somewhat 

from one period to the next. Such variations are 
systematic determining the intonation in part, acci- 
dental rather than unintentional, but nevertheless of 
importance for the naturalness of human speech. Only 
speaking machines are capable of producing a perfect- 
ly monotonic pitch. 


The fundamental frequency of the vocal cords is directly proportional 
to their tension, mass, and the subglottal air pressure. Since most 
adult males have vocal cords that are longer and thicker than adult 
females the fundamental frequency in the male voice is lower than in 
the female voice. 

For a normally speaking voice the fundamental frequency can range 


5 
between 60 hertz and 400 hertz. Н. L. Shaffer , through a series of 





age 
experiments, determined that the average fundamental frequency of a 


male speaker is 125 hertz, while the average for a female is 192 hertz. 


Another important term, formant, Is defined as the natural fre- 
quency of the vocal tract that corresponds to a resonance or peak of 
energy atf various frequencies referred to as resonant frequencies. The 
resonant or formant frequencies depend on the shape of the vocal tract 


and the positions of the articulators. 


There have been several models proposed for the speech production 
6 
apparatus. The model proposed by Oppenheim and Schafer gives a good 


insight into the manner in which the speech waveform is produced. 


Б КОБЕ Ор 
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Figure 1-1 
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The discrete-time model in figure 1-1 сап be examined in several 
sections. Тһе time varying digital filter corresponds to the vocal 
tract, and the coefficients of the filter may be changed to correspond 
to the frequency response of the time varying characteristics of the 
vocal tract. The vocal tract is usually slowly varying in normal speech 
and can be considered fixed for short periods of time (on the order of 
10 milliseconds). The filter can be excited by a train of impulses, 
p(n), or by samples from a random number generator, r(n). In voiced 
speech the impulse train, p(n), is used to excite the filter, and the 
Spacing between the samples corresponds to the pitch period. For un- 
voiced speech the sequence r(n) is used as a noise-type source for 
filter excitation. | 


8 
Oppenheim and Schafer describe a factor g(n) that accounts for 


the fact that the actual glottal pulses are not impulses. This addi- 
tional factor can be used with the other two components to describe the 


speech waveform, s(n), for voiced speech. 
Sy(n) = p(n) * vy (n) * gín) Clap) 


where: * denotes convolution 
The above expression can also be expressed in its frequency domain 


representation as: 
s (jo = Ped) vel) Gres) (Т?) 


As described above the impulse train, p(n), has a spacing between 
pulses Tp corresponding to the pitch period. P(eJW) is also a train of 
impulses separated by intervals of fp = 1/Tp. This shows that the 


energy of the speech waveform will lie in narrow bands centered about 





ge 


The fundamental frequency and its harmonics. 


Whenever the speech is unvoiced, the speech waveform can be des- 


cribed as: 
Ss an) ven) > gon) 0122) 


Alternatively as before: 
Su(eJ9) - К(е19) V (ej0) с(ејо) (1.4) 


There is not much that can be said about the properties of r(n) 
and R(eJU) other than it has "noise-like" characteristics. Therefore, 
for unvoiced sounds the spectrum of the signal, sy(n), lacks any type 


of harmonic structure. 


There are certain other aspects of the human speech system that 
should be considered. The ear which is considered to be a part of the 
overall speech system is insensitive to errors in phase of a signal, 
while on the other hand, the ear is extremely sensitive to errors in 
the pitch epochs or pitch periods. Many experiments have shown that 
the pitch of a speech waveform provides its naturalness, and if there 
are any errors in pitch of a processed signal, it becomes immediately 


apparent. 


This discussion has been directed toward the explanation and 
definition of certain aspects of the speech waveform that will be used 
and exploited in the following chapters. It was intended to show how 
the speech waveforms are produced and how these waveforms can be 


modeled. 
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CHAPTER il 


REVIEW OF ThE LITERATURE 


2.1 Shields! System 


Since the foundations of this thesis were taken from another thesis, 
some of the fundamental concepts and ideas used in a thesis by Vaden 
Shields will be covered briefly. Shields in his thesis, "Separation of 
Added Speech Signals By Digital Comb Filtering", suggests that the 
harmonic structure of speech may be the key to the effective removal of 
additive noise. He attacks the problem in the following manner. First, 
he looks into the problem of choosing an optimum unit sample response 
for the filter. Second, he examines the effects of unvoiced speech and 
filter interaction. Finally, he looks into the tracking and detection 
of one signal's fundamental frequency from a combined signal. The first 


Two areas will be covered in this summary. 


The filtering strategy used is simple. Since voiced speech has 
its spectral energy concentrated in narrow harmonic bands, another 
speech signal with the same type of spectrum would have its spectral 
energy concentrated in narrow harmonic bands that would, for the most 
part, not overlap those bands of the first signal. This property 
suggested the use of a comb filter adjusted to allow the first signal 


to pass and the second to be attenuated. 


The comb filter must have several properties when it is used in 





ED 
this application. First, the filter must be time variant. This pro- 
perty evolves from the fact that the pitch period of the speech varies 
with time, and the passbands of the filter must be able to move. 
second, the filter can only act over a limited portion of the speech 
waveform at one time. This is due to the fact that the approximation 
of periodicity of voiced speech only holds for a short time segment. 
Also, the duration of the impulse response of the filter should be 
finite and short compared to changes in the speech waveform. P 

The property above that requires the impulse response of the 
filter be finite implies that the digital filter be nonrecursive or 
FIR. There are several ways to design an FIR filter and the method 
Shields uses is to truncate the infinitely long unit sample response 
with a finite duration data window. If a filter were desired that 
would pass only certain frequencies, the "teeth" or passbands would 
need to be very narrow. One Type of filter for this application would 
be a unit sample train in frequency with the spacing of the samples 
fp = 1/MT p -œ< f<% where M is an integer and T is the 
sampling rafe. The unit sample response of this filter is a unit 
sample train in time with each sample separated by (M - 1) zeros for Ж 
- о < п < о, This filter would be set to pass only harmonics of fp. 

As mentioned above, the infinitely long unit Sams response of 
the filter must be truncated in order to create the FIR filter. This 
is done by multiplying the infinitely long unit sample response of fhe 
filter by the unit sample response of the data window. The multipli- 
cation that the above operation implies in the time domain corresponds 


+o a convolution of the respective transforms in the frequency domain. 
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The effect of this operation is to create narrow harmonically spaced 
passbands and this is referred to as a digital comb filter. 
The shape of the passbands and the stopbands of the comb filter 
13 | 
are a function of The Type of window that is used. Shields explored 


the effects of four common window functions: the rectangular, The 


Hanning, the Hamming, and the Blackman. 


Another way of looking at what is happening in this process is to 
go back to the step where the filter's infinite duration unit sample 
response and the data window were being multiplied together. Looking 


at these results gives the following equations: 


Suppose x(n) is a finite duration sequence with a Fourier Trans- 
form, X (eI) | where: 
a N-1 э: 
A EA m) 
n=0 
Now, suppose that x(n) is multiplied by a function f(n), whose unit 


sample response is of the following form: 


m-1 ¡2Trn 
fnn 1 ) e m ; for - о < n < % (2.2) 
m r=0 


f(n) is a train of unit samples that have (m-1) zeros between each unit 
sample. lf z(n) if formed from the product of x(n) and f(n), then, the 


resulting Fourier Transform of z(n) is calculated as fol lows: 
z(n) = x(n) f(n) (2.3) 


Substituting into equation (2.1) 


N-1 | 
Z(eJW) = ) x(n) f(n) e JP (2.4) 
n=0 





HO 


Substituting equation (2.2) in for f(n) in (2.4) gives: 


N-1 m- 1 ¡2Trn . 
Жоу na) е m E Jon (215) 
n=0 m r=0 


Performing an interchange of the summations in (2.5) gives: 


m N-1 EJUS Zn) 


=| 
) ) х(п) е m (2.6) 
=0 


u 
m r n=0 


Equation (2.6) may also be written in the following form: 


Шәй eU 
Ze = 1 ү хе i; (2.7) 
r=0 


1 
m 
Interpreting (2.7) may be easier if considered in the following form: 


Zeit) = 1 Excel) + Xcel 2D) +, 


1. 
m 
m X(ed W = 2n(m-1)), ] 

m 275) 
From equation (2.8) it can be seen that if X(eJU) is periodic on the 
interval {0,27}, then, Z(eJU) is composed of a sequence of teeth that 


are periodic on (0,27 ), and of the same form as X(eJ9)., 
m 


The difference equation that is implemented by this method can be 


expressed as: 


K 
y(n) = ) ay x(nT + kmT) (2.9) 
к--К 
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It can be seen that the output is merely a weighted sum of the input 
values separated by mT seconds. The coefficients, ay» are determined 
before the processing starts and are fixed by the input parameter K. 
This dependency only on the value of K may be shown in the following 


manner. 


Using a Hamming Window with length, N = LM, where L = 2K * 1, 


and L = the number of coefficients used, the expression for the window is: 


w(n) = 0.54 - 0.46 cos (2mn/N) for O<n<N - 1 


= 0 elsewhere, (2.10) 


Substituting in for the equivalent expression for N: 


w(n) = 0.54 - 0,46 cos (2m(¡M)/LM) (2.110 
where n = iM are the only points that have non-zero values. Now, 
w(iM) = 0.54 - 0.46 cos (2mi/L) | Caz 


It can be seen that the coefficients have no dependency on the value 
of Mor fundamental frequency. This shows how the coefficients may be 
calculated and stored before processing and are always the same after 


the parameter K has been chosen. 


There are several factors or properties of ee that must be 
considered. First, the longer the unit sample response of the window, 
the narrower the passbands of the comb filter will be. This occurs 
when the fundamental frequency decreases. Another consideration is 
the fact that a limit exists for the duration of the window due to 
the changing pitch period. For a fixed window length, the choice of 


window will change the filter characteristics such as; passband width 





zd 
14 
and stopband attenuation. In his thesis, Vaden Shields studied the 


choices for type of window and window length. 


Both of these studies were experimental. The length of the window 
was directly proportional to the parameter K and signals were processed 
with and without an additive noise signal to determine the effects of 


distortion and separation of the original signal. 


The experimental observations agreed with the theoretical results. 
As the value of K was increased (this corresponds to a longer window 
length, more coefficients, and narrower passbands), the separation of 
two speakers improved, but the er signal became more distorted. 
Shorter window lengths yielded less separation and less distortion. 
Through a series of listening experiments the value K = 5 was chosen 
as a good compromise value from the distortion and separation view- 
points. Í 

The search for the optimum wi ndow type started with the immediate 
elimination of the rectangular window. The rectangular window intro- 
duced significantly more distortion than Hamming, Hanning, or Blackman 
windows. The differences between these three windows were slight, and 
after listening tests the Blackman window was chosen as the best 
E C 

The second area of Shields! thesis dealt with the treatment of un- 
voiced speech. His first approach was to turn off the comb filter upon 
the detection of a pitch period greater than 20 milliseconds. This ap- 
proach seemed to work effectively when used on.a signal without any 


noise, but did not effectively separate the desired signal from the noise. 


During unvoiced sections of speech, the noise signal would "pop out" 
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of the processed signal. This effect proved to be distracting to listen- 
ers and caused an impairment to intelligibility. 7 

Another approach for the treatment of the unvoiced speech seg- 
ments was used. Whenever an unvoiced segment was detected, the comb 
filter would continue to use the last valid pitch period. This ap- 
proach achieved better results for noise suppression, but it tended 
To distort the desired signal. Ë Shields suggested that other methods 


or approaches for the treatment of unvoiced sections should be investi- 


gated due to the fact that no one method had proven to be superior. 


Shields stated that based on his observations of the results 
this approach to speaker separation works. The performance of the 
system was measured by informal listening tests. The best system 
performance occurred in the recovery of a female speech segment from 
a male - female combined signal. On the other hand, the poorest per- 
formance occurred in the recovery of a male's speech segment from a 
male - female sum. This can be explained in the following manner. 
This method of comb filtering allows for a variable number of pass- 
bands or "teeth" which depend on the pitch period M. If M is the 
value of the pitch period at a given time, then, the number of passbands 
up to the Nyquist frequency (5 kilohertz) will be M/2. |n a female . 
voice the pitch period is generally shorter due to many factors such 
as; size and shape of the vocal tract and articulators. This shorter 
pitch period would cause fewer passbands to fall within The frequency 
range of the speech spectrum. With fewer passbands present fhere is 
less chance of falsely recovering an unwanted signal. With the roles 
of the signal reversed, the opposite effect would occur explaining The 
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difficulty in extracting the male's voice from the combination. 








p 
During the listening tests several factors were noticed that 
caused problems in this system: "unitelligible unvoiced segments, 
short segments of complete distortion of the signal, and breakthrough 
of the undesired phrase." B One cause of the distortion problem was 
attributed to the rapid changes in the pitch period over a short in- 


terval of time. Another cause of the distortion stemmed from the 


inaccuracies in marking the pitch period which was carried out by hand. 


The problem of incomplete or insufficient attenuation of the un- 
wanted speech segment was most bothersome during periods of silence. 
Shields suggested that if periods of silence could be distinguished 
from unvoiced segments, then unvoiced segments and silent segments 
could be treated separately. Ë 

This approach for comb filtering was also tried with white noise. 
Shields originally thought that if the noise were wideband with most of 
the energy outside the passbands, the signal-to-noise ratio would be 
improved. This method failed to produce the desired result, and it 
changed the wideband noise into noise that was harmonic. This dis- 
torted the output and produced a signal that had a "reedy" sound. ~ 

Using noise that has highly harmonic, this method showed some 
promising results. The suppression and distortion were much better 
when the fundamental frequency of the noise was below the average 
pitch of the speaker. The "pop out" effect could still be observed 
when the passbands of the comb filter moved into the range of the 
frequencies where the noise was present. E 


The remainder of Shields' thesis deals with the automatic extrac- 


tion of pitch period from a combined waveform set by  »pstral analysis. 
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Since another type of pitch detection will be used in this thesis, this 


section of Shields! thesis will not be reviewed. 


The conclusions that Shields reached can be summarized in the 
following manner. First, this method will separate the combined 
speech signals of two speakers. Second, this scheme will reduce the 
effects of harmonic noise. Third, possible areas of Improvement in- 
clude the treatment of unvoiced segments and periods of silence. The 
listening tests should be conducted in a more sophisticated manner in 
order to reveal errors in the system. Fourth, the area of pitch 
detection could be improved. Also, the choice of coefficients may not 
be optimal, and a better frequency response might be obtained using 
another set of coefficients. Finally, Shields suggests The usage of a 


second comb filter that would use The pitch information of the un- 
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wanted speaker to block or reject the unwanted speaker. 
2.2 Parsons! System 
25 
In a recent paper written by Thomas Parsons, the problem of 


the automatic separation of the simultaneous speech of two talkers is 


approached in a different manner. 


Parsons points out that the brain requires binaural data in 
order to separate the combined signal, and in some cases, a signal is 
received over a channel that does not provide this binaural informa- 
tion. Also, Parsons states that in most schemes, the signal enhance- 
ment exploits characteristics of one signal or statistical differences 
between the signals. The properties or statistics of speech are not 
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understood well enough. 
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The method that Parsons uses to separate the two talkers capital- 
izes on the harmonic structure of short segments of speech. In the 
frequency domain the harmonic structure of the speech appears as peaks, 
and the procedure involves removing the peaks of the unwanted speaker, 
and then, taking the inverse transform of the remaining spectrum. This 
is a rather complicated procedure to Implement automatically for 
several reasons. The frequency peaks from the two speakers will over- 
lap In some areas. The overlap problem is compounded by the pitch 
variations that occur in natural speech, and this variation causes a 
frequency modulation (FM) distortion of the peaks. Other problems 
occur from the event of the pitch tracks crossing. This may cause 
errors that lead to jumping between speakers. ú = 

Parsons uses four rules for detecting an overlap in the peaks, 
and if any of these criteria are met, then, the process of separating 
the peaks begins. By using the a priori knowledge of the peak shapes, 
an estimate of the peak may be subtracted from the combined peaks in 


order to resolve them. 


The next item addressed is the determination of the peak shapes. 
The largest problem is the effect of the frequency modulation on the 
peak shape. Parsons states that by assuming the pitch variation is 
linear over a short segment, the peaks at the harmonics can be 
approximated by a linear FM ramp. He states that for slow pitch rates, 
the peak shape is approximately the same as in the case of a constant 
pitch period. The frequency modulation of the pitch period is shown 
as a quadrature component with an amplitude proportional to the pitch 
rate, and the peak has a shape that is the second derivative of the 


in-phase component. This approximation for the peak shape holds for 
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frequencies up to 3 kilohertz. 


The separation of the two speakers is done by synthesizing the 
spectrum from a knowledge of the pitch contour and the amplitude and 
phase of every one of the harmonics. The method of synthesis works 
better than the method of subtracting the unwanted speaker's harmonics 
from the combined spectra. Parsons states that if the subtraction 
methods are used, any errors in parameter estimation will result in 


incomplete cancellation of the undesired speaker's harmonics. 


Parsons states that fhis system is still in its beginning stages, 
and that problems of unvoiced speech still have to be solved. Other 
problems encountered are similar to those mentioned by Shields. These 
problems include pitch detection, tracking one speaker's pitch contour, 
and the areas of speech that have pitch rates much faster than in 


normal speech. E 


Both methods summarized in this chapter represent possible solu- 
tions to the speech enhancement problems, and there are probably 
other existing methods. The purpose of this chapter was to acquaint 
the reader with some of the methods that had been proposed, and nof 


an attempt to cover all possible methods for speech enhancement. 


Problem Formulation Introduction 


The problem at this point was to decide how the methods for speech 
enhancement could be improved. The system proposed by Shields was 


used as a guideline because his work had been the only source of 
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information on the subject before Parsons proposed his procedures. 


The primary concern in this thesis was to explore the methods 
used for speech enhancement in voiced sections. The unstructured un- 
voiced speech segments were studied, but because of its lack of struc- 
ture, the procedures were, for the most part, trial and error. Chapter 
IV formulates the method that was proposed as an alternative to the 
comb filter. The method is developed around the structure and time- 


varying properties of the speech waveform. 
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CHAPTER 111 
ЕЛШІСІН DS EC IION PROCEDURES 


The need for a simple and accurate algoriThm for determining 
pitch period or fundamental frequency can not be over emphasized. 14 
the separation of a speaker and noise is to be done in some small 
amount of time by the methods of comb or adaptive filtering, most of 
The pitch detection algorithms have ка be discarded because of their 
complexity and execution time. The accuracy in determining pitch 
period is also very important. The filtering techniques that are 
employed in this thesis use the pitch period information to set the 
time-varying digital filter. These methods of filtering assume that 
the pitch period information ts exactly known for a given speech wave- 
form, and therefore, errors in pitch detection downgrade system per- 
formance. An alternative approach for pitch detection is presented in 


this chapter. 


3.1 Research Performed by Henke 


50 
In a recent paper, Henke , discusses the features of an accel- 


erometer signal that measures glottal movements may lead to pitch 
period detection for a speech waveform. The waveforms shown in 
figure 3-1 immediately show why an accelerometer waveform is more 
desirable from the standpoint of pitch epoch marking than the 


corresponding speech waveform. 
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ACCELEROMETER WAVEFORM 
SPEECH AND ACCELEROMETER WAVEFORMS 


Figure 3-1 


Henke states that a small accelerometer can be attached on the 
midline in the suprasternal notch (2 or 3 centimeters below the 
glottis) to measure the varying pressure waveform. He suggested that 
the most prominent and stable feature, the "flyback stroke", be used 
In pitch period detection algorithms. The "flyback stroke", shown in 
figure 3-2, is a feature of the pressure waveform that is character- 
ized by a rapid change from outward to inward acceleration. This 
feature occurs immediately following the maximum outward acceleration. 
The maximum outward acceleration occurs at the instant of closure 
of the glottis or shortly thereafter. Therefore, the negative going 
segment of the waveform that crosses the zero line creates a stable 
point for pitch epoch determination. Я After these pitch epochs 


have been marked, it is a simple task to determine the pitch period 


or fundamental frequency from these markings. 








Ee 


EE SURE 


| 


FLYBACK STROKE 


Figure 3-2 


3,2 Methods of Pitch Detection Used 


Henke's method was used in the work carried out in this thesis. 
A program was written to process a pressure waveform that was ob- 
tained from an accelerometer at the same time as the speech signal 
was recorded. This was done in a two channel interleaving mode at a 
sampling rate of 10 kilohertz with the speech signal on one channel 


and the accelerometer signal on the other. 


A program was implemented to find all peaks and zero crossings in 
a segment of the accelerometer waveform. These peaks and zero cross- 
ing locations were stored in two separate arrays, and a threshold was 
used to determine which peaks were the ones that were associated with 


the instance of closure of the vocal cords. The first Threshold 
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method consisted of simply setting a constant threshold, by guessing 
before the processing took place. This threshold was used Throughout 
The program to determine which peaks were the correct ones. The 
constant threshold approach yielded two types of errors that can be 
described with figure 3-3. First, the problem of false detection or 
detecting a peak that was not associated with glottal closure. Second, 
the false rejection problem had the complementary effect, in that, a 
True peak was not large enough to be detected by the constant threshold. 
The constant threshold method yielded about 40 to 50 percent total 
errors of both types, and the constant threshold was found to be 
speaker and gain dependent. The errors incurred by this method 


Suggested a second approach for the threshold determination. 
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Figure 5-5 
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The problems in the first approach resulted from the dynamic range 
of the accelerometer waveform. The second approach consisted of set- 
fing a variable threshold. The variable threshold was set by means of 
an energy measurement of the glottal signal over a short time interval. 


The energy, E, was measured in the following manner: 
N-1 2 
E= J x(n) | (3.1) 
=0 


where N was Taken to be 25.6 milliseconds. 


The range of the values for the glottal peaks was divided into 
five levels, and after a careful examination of a few waveforms and 
energy measurements, these ranges were experimentally cetermined. The 
results of the variable threshold approach were quite encouraging. The 
number of errors made was on the order of 20 percent, and this figure 
could be improved if a more careful statistical study were made to 
determine the optimal threshold settings for a given energy measure- 


ment value. 


The idea of the energy measurement also suggested a concept for 
silence detection. In the thesis done by Shields, he suggests that if 
silence could be detected, then some other approach could be taken to 
improve separation. The silence detector was included in the program 
in order that different schemes of filtering could be tried. A 
silent area was determined when the value of E was equal to zero. 

[The silent areas were marked in the table of pitch period values by 


a minus sign in front of the pitch period value. | 
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"After The peaks were detected, zero crossings in the accelero- 
meter signal Immediately following the peaks were marked. The zero 
crossing was the termination of the "flyback stroke", and as mentioned 
before, is thought to be one of the more prominent and stable features 
In the glottal waveform. These zero crossings were marked by using 
the second channel of the file, since the signal only required one 


channel. 


Since the minimum pitch period is around 2.0 milliseconds, the 
algorithm included a rule that repositioned the pointer in the wave- 
form by 20 samples after a correct peak was detected. This helped to 


speed up the algorithm somewhat. 


There was another problem that is still associated with this 
method of pitch determination. The transitions just before voiced and 
unvoiced sections were not of the same form as during voiced sections 
or unvoiced sections. Even looking at the section of the waveform 
that was considered a transition area did not help in the decision of 
where to put the marks for the pitch period. Each signal had to be 
marked by hand in these areas, and fortunately, there were not many of 


them in a waveform. 


The outputs of the pitch detection included two separate files. 
One file consisted of the pitch marks. This file could be added to a 
speech signal for the filtering of the signal, or it could be added to 
an accelerometer signal to determine whether the pitch detection 
algorithm had made any errors. The second output file consisted of 


pitch period values that were determined from measuring the distances 
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between the marks in the previous file. The file is referred to as 


the pitch table, and it is used in the filtering programs. 


The methods used in this section were intended to make the pitch 
detection as easy as possible. The filtering schemes used in the 
thesis assumed that the pitch information was known exactly. The pri- 
mary function of the automatic pitch marking program was to eliminate 
as many periods as possible that would have to be marked by hand. If 
pitch period detection accuracy had been required in the automatic 


marking program, more improvements would have been made. 
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CHAPTER IV 


ADAPTIVE FILTERING CONCEPTS 


4.1 Development 


The idea for the pitch synchronous adaptive filter was developed 
from the ideas of Siamak Samsam as an alternative to the conventional 


comb filtering techniques. 


The name comb filter probably came about from its frequency 
response characteristics. The frequency response of a comb filter, 


shown in figure 4-1 consists of a series of "teeth" or "fingers". 
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Figure 4-1 
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The implementation used for the comb filter in figure 4-1 was discussed 
In section 2.1. The basic idea behind this implementation was to pad 
the impulse response of a low-pass window function with zeros, using 
The same number of zeros between the coefficients as thé pitch period 
of the signal at that particular time. The spacing between the co- 
efficients of the Impulse response was uniform giving the frequency 


response in figure 4-1. 


In this Type of comb filtering implementation the length of the 
filter was on the order of five to nine times the length of one pitch 
period, and this is equivalent to a range of filter lengths between 
40 and 140 milliseconds. In a normal speaking voice the pitch period 
or fundamental frequency can change rapidly over several periods 
depending on intonation or stress, and if the filter length is long, 


the comb filter may not handle The correct samples by the procedures 


discussed in section 2.1. The phenomenon discussed above can pro- 
bably be more easily explained by figure 4-2. 
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Figure 4-2 
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suppose That the signal in figure 4-2 is a speech waveform with 
piTch periods Ты; where p = 0 to 5. It can be seen that the values of 
the pitch periods vary over the length of the filter. Suppose that the 
number of zeros between coefficients has just been changed to cor- 
respond To The pitch period T_, and all spacings between the samples 
are adjusted so that they correspond to the period Ty. This results 
in a misplacement of the coefficients in the waveform with respect to 
the beginning of each pitch period. In the case o az, its position 
in that particular period is much different from the position of co- 


efficients aç, ai, and a». 


Now, suppose that the pitch period Ig, has a smaller error due to 
The pitch detection program. Since all spacings are uniform, the 


error is propagated and compounded in the subsequent spacings. 


With some insight into what the comb filtering was trying to do, 
the adaptive methods were formulated. Suppose that the coefficients 
are rearranged so that the spacings are no longer uniform, but  in- 
stead, the spacings will be made equal to the pitch period for that 
particular point in the waveform. Figure 4-3 shows the placement of 
the coefficients for the adaptive filtering methods. For this method 
each coefficient is in the same relative posi tion in its particular 


period. 


This method does not really correspond to a comb filter as shown 
by the frequency response of a sample adapiive filter in figure 4-4. 
The most substantial thing that can be said about the structure of The 


frequency response shown in figure 4-4 is that it has no general 
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Figure 4-3 


structure. The structure of figure 4-4 points out the frequency res- 
ponse of the adaptive filter provides little insight into its 


capabilities in the speech enhancement problem. 
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E. output of this system can be expressed in the following 


equation: 
2K 
y(n) = b) a, xn-j) 
[70 
where: a; = тле coefficients or weightings 
ЈЕ = 0 
p 
| =m tm,t...m.i 
and, m; = the pitch period of the jth period from the 
originating point. 
2K + 1 = the number of coefficients used. 


There is another method of expressing the ideas given earlier. 


This description is aided with the use of figure 4-5. 


The method can be thought of in the following manner. First, 

The waveform is broken into segments according to the pitch epoch or 
positions where the pitch marks occur. Then, these segments are lined 
up as shown in figure 4-5. Then, a weighted average is ет point 
by point as the filter moves in the direction indicated. What this 
operation intuitively does is to develop an average impulse response 
based on the previous several periods. This technique works wel! in 
conjunction with the assumption that the impulse response of the vocal 


tract is slowly varying. Another important point to be said for this 
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ADAPTIVE FILTER 


Figure 4-5 


technique is the fact that in the separation problem, the weighted 
samples of the desired speaker will add coherently while the contri- 
butions of the undesired speaker will not add coherently. In the 
comb filtering methods the same statement can not be made as strongly 
when the pitch is changing. Since the averaging process in the comb 
filter looks at different relative positions in each period, the 
reinforcement or coherent addition of the A сое is not as 


pronounced as in the adaptive methods. 


It should be pointed out that both methods are identical, and in 
fact, a comb filter when the pitch period is constant. In other 


words, the adaptive filter approaches a comb filter when the 
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pitch period contour is slowly varying and performs a comb filter opera- 
tion when the pitch period contour is constant. When the pitch con- 
Tour is fluctuating rapidly, the adaptive filter does not resemble a 


comb filter in the least. 

There Is one problem in this method, and this problem with a 
possible solution is discussed in the next section. 
4.2 Overload Problem 


A problem in the adaptive filtering method referred to as the 
"overload problem or phenomenon" can be described with the aid of 


figure 4-6. 





Figure 4-6 
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Suppose that there is a rapid increase in the pitch period in a 
relatively short interval of time. This phenomenon is not rare in 
normal speech. For some speakers a distinguishable feature or 
characteristic is the short pitch period that occurs at the onset of 
voiced speech. This short period can be represented by the segment 
in figure 4-6 labeled as i5. As the filtering moves to the right com- 
puting a point by point average, The method runs into problems at the 
point 12. Up fo the point T5 the filtering has occurred in the cor- 
rect manner. After qT, the coefficient a, begins to use samples in seg- 
ment two for the averaging. This feature disagrees with the concepts 
of the adaptive filtering methods because the coefficients now are not 
in the same relative positions in their respective segments. This 
phenomenon has been named the overload problem and causes an undesirable 
result in the output waveform. The overload phenomemon will be studied 


more in the next chapter. 


There are several proposals for solutions to the problem of over- 
loading. First, the easiest solution from all viewpoints is to ignore 
the problem and let the overloading occur. If the pitch period is 
slowly varying, the problem will be minor and the coefficients wil | 
not be far away from the same relative position in their respective 
periods. Second, the adaptive filter could be "turned off" at T, and 
pass the input signal completely for the fime interval between Т апа 
Tg . This idea originated from the fact that the impulse response of 
the vocal tract would be of sufficiently low amplitude in the tails 


of the impulse response so that the filtering operation would not have 


much effect in this area. The final solution proposal, and the one 
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that was used in this thesis was one that padded the short segments in 
the filter's length with zeros in order to make the length of the 


short segments equal to the one in the front of the filter. 


Figure 4-7 explains this procedure. The shortest period, seg- 


ment 5, will have zeros added in the area indicated. 
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Figure 4-7 


When the filter reaches the end of segment 4, the same problem occurs 
and zeros are again added to pad out this segment. In effect this is 
the same procedure as "turning off" the individual coefficients asso- 


ciated with the segments that have been padded with zeros. In figure 





AN 
4-7 this would involve setting a> equal to zero after T, is reached 


and az equal to zero after Tz Is reached. 


There is one other factor to be considered in this method. The 
gain associated with the filter has been fixed to be a unity gain 
system by: 


2 
a, = 1 (4.2) 


Н 2 =< 


к= 0 


When a particular coefficient is set to zero or "turned off", the 
remaining coefficients must be rescaled so that the output is not 


attenuated. 


The third procedure for handling the overload problem makes 
sense from an intuitive standpoint. It says that if a coefficient 
can not be placed in the same relative position in the segment or 
period due to the shortness of that period, then, the contribution of 
that coefficient will only provide erroneous results, and therefore, 


it should not be considered. 


4.5 Rules 


As the filtering algorithms become more complicated and complex, 
the list of rules for handling various situations also grows. Some 


of the more important rules used will be discussed in this section. 


The discussion thus far has dealt only with the procedures for 
handling voiced speech. Normal speech waveforms contain unvoiced 


sections also, and these sections must also be handled. There are 
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three basic areas or situations involved in the processing of unvoiced 


speech sections. These situations are: 


l. The detection of an unvoiced section, and transition 
from voiced speech procedures to unvoiced speech 
procedures. 

2. The procedures that are used when the filter is entirely 
fn the unvoiced section. 

3. The detection of a voiced section, and transition from 


unvolced speech procedures to voiced speech procedures. 


The first situation mentioned above was handled by using the last 


voiced pitch period as shown in figure 4-8. 


In figure 4-8 the speech waveform has been omitted for clarity, 
but the sections are marked as voiced and unvoiced. Тһе values labeled 
m; are equal to the pitch period of that particular period. In figure 
4-8 (a) the filter position is shown just prior to an unvoiced area 
detection. Figure 4-8 (b) shows the spacing after the first coeffi- 
cient has entered the unvoiced area. In this figure two of the 
spacings have a spacing of т, while the other two spacings are related 
to their particular periods. In figure 4-8 (c) another spacing 
change is shown so that three spacings are equal to my. This pro- 


cedure is repeated unti! the entire filter lies in the unvoiced area. 


Now the second situation has been entered, and another scheme is 
begun. Since the method of attenuating the input in an unvoiced 
section worked satisfactorily in the system proposed by Shields, it 


was used in this system also. In effect, the filtering was terminated, 
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POICED-UNVOICED SPEECH TREATMENT 


Figure 4-8 


and the input waveform was multiplied by a constant less than one in 


order to produce the output waveform. 


In the third situation the procedure was basically the inverse of 
the method used in the first situation. When a voiced section was 
detected, the filter was initiaiized to a constant spacing using the 
first period value. As the filter moved further into the voiced 


sections, the spacings were changed to conform to the pitch periods. 


In summarizing this chapter the most prominent feature of the 
adaptive filtering approach is its conformity with the speech waveform. 


The performance of this system wi!! be evaluated in later chapters. 
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CHARTER SY 
TESTI SIGNAL SECT ION 
5.1 Development of Test Signals 


It was decided that before any new methods for speech enhancement 
were developed, a method of testing the various schemes should first 
be undertaken. Even though psychological listening tests were to be 
conducted in the future, there were just Тоо тапу variables in the system 
to be handled in a complex listening test. In order to assign some 
types of performance indicies, and in order to fix some of the vari- 


ables, a test input signal was generated and stored for future use. 


The purpose of the test input signal was to alleviate the un- 
certainty of characteristics of the speech waveform, such as, the 
pitch period and the impulse response. When the pitch period and the 
impulse response of a voiced section of speech are known, then, some 
types of input and output waveform comparisons can be made in order fo 


determine optimal filter properties. 


A test input signal was formulated in the following manner: 
First, it was decided that enough comparison data could be obtained 
from a relatively short segment of the input signal. This feature 
would provide results that were accurate, and in addition, they could 


be obtained quickly. Second, it was decided that the impulse response 
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of the generation filter would be time-invariant over the duration of 
the signal. This assumption was chosen in order to make the compari- 
son study easier, and it conforms to the properties of speech, in that, 
the impulse response of the vocal cords over short time segments can 

be considered constant. Third, it was decided that the pitch period 


would vary sinusoidally about some mean pitch period. 


The input signal formulation can be described by the following 


diagrams and equations: 


Let x(n) = the test input signal. 
v(n) = the impulse response of the generating filter. 
w(n) = the nonuniformly spaced impulse train corresponding to 
the pitch pulses or the excitation. 
Иепе x(n) = vln) * wn) or Сар 


TEST SIGNAL GENERATION SYSTEM 





Figure 5-1 


The impulse response of the generating filter, v(n), was chosen 


to be a damped sinusoid of the following form: 
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ДЕЕ Е ЕЕ Е-Е RESPONSE 


Figure 5-2 


where: v(n) = eT/%  sin2mn 0 <n < L-1 (5.2) 
The model seems to be a very good model for the impulse response of 
the vocal cords. The parameters a and w were variable and were 

chosen so that the input signal that was generated would be a good 


model of speech. 


The length of the impulse response was chosen to be 2T., where 


О? 
Tg is the value of the mean pitch period. It was felt that In order 
to model speech as closely as possible, the length of the impulse 

response of v(n) should be longer than the pitch periods encountered. 


This would cause some overlap of the impulse response, v(n), in the 


input waveform. 


The nonuniformly spaced impulse train, w(n), was formulated in 
the following manner: The spacing between the impulses, (the pitch 
period) was chosen to vary sinusoidally about some mean pitch period 
Ig . The figure 5-3 shows the procedure for this development. 1+ 


can be seen that this nonuniformly spaced impulse train is periodic 


on the interval [0, N |. 
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Figure 5-3 
where: б. =6sin2 Ti for i = O to N-1 
N (555) 


Now with these two components the input signal x(n) was gen- 
erated. This gave a waveform whose impulse response, and pitch 
contour were exactly known, and the various methods of Speech 


enhancement could be tried on this signal. 


952 Processing of Test Signals 


With the test signals that have been described in the previous 
section, several processing methods — now be more closely examined. 
The filtering schemes used in this section were designed so that they 
resembled the actual filtering systems in all respects except for the 
length of the input waveforms used. Since the test input waveform was 
much shorter than the actual speech waveforms, it could be completely 
stored in core memory, and the amount of time needed for the signal 


processing was short. 


There were three filtering systems implemented for the test signal 


input waveform. These systems included the system proposed by Shields, 
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the adaptive filtering systems proposed in Chapter IV with The ocver- 
load problem, and the adaptive filtering system without the overload 
problem. For the remainder of the thesis the adaptive system with the 
overload problem will be referred to as the adaptive overload system. 
The adaptive system that compensates for this problem will be referred 


to simply as the adaptive system. 


Another problem that received much thought was the problem of com- 
paring these system in some manner to determine the good and bad points 
of both systems. Several tests were decided upon that were related to 
the problems of speech enhancement. In the thesis done by Shields, the 
tradeoffs between desired speaker distortion and undesired speaker 
separation were described. These tradeoffs were also discussed in Chap- 
ter Il. It was decided that two separate tests could be conceived for 


this area. 


The arcs + test will be referred to as the input pass test. Figure 
5-4 describes the manner in which this was implemented. By using the 
enhancement system in this manner, the output waveforms should exactly 
resemble the input waveform, or in other words, the system should act 
as an identity system. By examining the inputs and the outputs of the 
systems, the amount of desired speaker distortion could be determined 


for various numbers of coefficients or windows used. 


The second test, described by figure 5-5, will be referred to as 
the input reject test. In this type of filtering the pitch information 
is provided from another input waveform that is totally uncorrelated 


with the test input signal. The idea behind this type of processing 
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Figure 5-4 


is to examine the amount of attenuation introduced into the output 
waveform. Ideally, the output would be zero, because the enhancement 
system would be passing another waveform with a different pitch con- 
tour. This test was designed in order to measure the amount of 


speaker separation that resulted from the various systems. 
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Another test was devised in order to examine the capability of 
each system when white nolse was added to the input signal. Figure 


2-6 shows the implementation of this system. 
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Figure 5-6 


Shields stated that the system proposed in his work seemed to turn 
white noise into a highly harmonic noise in the output waveform that 
was very annoying. i This test was designed to determine if the 
adaptive filter had a problem with white noise, and if it did, the 


systems could be compared to determine which one did the best in these 


EncumsTtTances. 


Finally, a test could be performed on the sum of two waveforms 
that would duplicate the problems of the overall speech enhancement 
system. IT should be pointed out that this test was merely a 
combination of tests one and two, and since the systems were linear, 


superposition would have given the same results. 
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In all cases it was assumed that the entire test signal was 
periodic with a period of twenty pitch periods, and the processing 


only concerned itself with one period of the overall waveform. 


It should be pointed out that the test signals used in these tests 
were not actual speech waveforms. The input waveform was modeled to 
closely resemble the speech waveform during voiced sections. There 
were no comparison tests made on test signals modeled as unvoiced 
speech. In the next section the results of these tests for the differ- 


ent systems implemented will be discussed. 


2.5 Results of Test Signal Processing 


In order that the results of the test signal processing be pre- 
sented as clearly as possible, many figures will be employed in this 
section. There are three basic types of presentation used in this 
section. First, a time domain presentation is used to show how the 
various systems process the input waveforms, and from these figures, 
the input and output waveforms can be compared visually. Second, a 
frequency domain presentation is used. The test signal was formulated 
to simulate a signal that had been sampled at 10 kilohertz. A short 
time spectrum was computed by resampling the input waveform at 5 kilo- 
hertz, multiplying by a Hamming Window, and then, the spectrum was 
computed using a Fast Fourier Transform (FFT) program. The bandwidth 
of the test signal was well pee the Nyquist Frequency for The new 
M ling rate. The frequency domain presentation displays logarith- 


mically the square of the magnitude of the Fourier Transform of the 
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signals, and this was computed to eliminate the effects of any phase 

errors that were introduced by the filtering. Finally, an error func- 
tion is shown in some cases, and this error function can be expressed 


in the following equation: 


° 2 ° 
Е = 20 log(|XteJ]^ ) - 20 1ogt|Ytes®ı]? } (5.4) 
where: X(eJ9) is the Fourier Transform of the input signal 


Y(eJ9) is the Fourier Transform of the output signal. 


Notice that the error function defined above can be thought of in 


another manner. Consider the linear system shown in figure 5-7 below: 





Figure 5 7 


Clearly, 

(ПІ exon) > nin) E 5570) 
Expressing equation (5.5) in the frequency domain: 

Y(te9)  - x(eJ9) Hed) (5.6) 


Y(eJ®) 
X(eJu) DATA 


Now, H(eJ9) 
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Taking the magnitude and squaring both sides gives: 








Н(е®)|2 = | vteJO3|2. » |vcejoy]2 
X(eJ0) |ХСе1®)|2 (5.8) 


Now, taking the logarithm of both sides gives: 


log|HtejW)|2 = log | [ү(е4®)|2| = 
|X(eJ9) |? 
log( |Y(eJ9)|2) EX ОО |х(е.19) |2) (5.9) 


Notice that this is almost the same form as the error function. H(eJw) 
is often referred to as the system transfer function, and the error 


function may be represented as: 


E = log 1 
|H(el®™)|2 (jO) 


Therefore, The error function could be Thought of as being related To 


the inverse system transfer function. 


Before proceeding the following discussion will be covered in or- 
der to show how the error function will be used to calculate system 
performance. Consider the system proposed in figure 5-7. Suppose that 


the following result was desired: 
y(n) = a x(n) (Sal) 


Expressing this in the frequency domain and because the system is 


linear: 


у (ејб) = a X(eJ9) (5.12) 








xx 


Now, Х(ејо) Sm 
Y(eJU) a СЭЭ) 
log | [XteJ9) [2 = 109 | 1 
|Ү(е.%) |2 EE (5.14) 
E = logí|X(eJ9)]2) - t10ogt|Y(eJO)|2) = tog] 1 | (5.15) 
Es 
E = log (1) - 2 109{|а[} = - 2 togtlal} (5.16) 
The above results will be used in later parts of this section for 


comparative analysis between the systems implemented. 


The first test described in section 5.2 was named the input pass 
test. This test was performed with the three basic systems developed: 
Shields', the adaptive overload, and the adaptive. For all systems 
five coefficients were used, and these coefficients were taken from 


the Hamming Window function. 


Figure 5-8 shows the time domain input and output waveforms for 
Shields! system. Several things can be noticed in the output waveform. 
First, there is a definite amplitude modulation throughout the wave- 
form. Second, there are some areas where an "overload bump" occurs 
in the output. Figure 5-9 shows the same presentation for the adap- 
tive overload system. It can be noticed that the amplitude modulation 
does not exist, but there are still some problems with overloading. 

In figure 5-10, the adaptive system results are shown: input and out- 


put waveforms are essentially identical. 
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A similar comparison can be made by viewing the spectra of the 
input and output waveforms. Figure 5-11 shows the spectrum of the 
input waveform, while figures 5-12, 5-13, and 5-14 show the output 
spectra of Shields' system, the adaptive overload system, and the 
adaptive system respectively. Figure 5-15 presents the error function 


for each of the systems. 


For this test the system should be performing as an identity sys- 
tem. This says that the value for a in equation (5.11) should be 


equal fo unity. Substituting this into equation (5.16): 
E = -2 |104 (1) - 0 (5.17) 


This result is approached in all three systems, but the adaptive system 


comes the closest to matching the ideal error function for this test. 


From all three presentations, the adaptive system clearly performs 


much better than the other two systems on the input pass test. 


The second test performed was named the input reject test. The 
purpose of this test was to determine how well the systems would re- 
ject or attenuate an undesired speaker. This corresponds to the separ- 
ational aspects of speech enhancement. The manner in which this test 
was conducted was simple. A waveform was generated with different 
pitch periods, and the pitch marks from that signal along with its 


pitch table were used to filter the original waveform. 


The second waveform had a pitch period that also varied sinusoid- 


ally about some average value that was identical to the original 
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Figure 5-13 Output Spectrum - Adaptive Overload System 
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Figure 5-14 Output Spectrum - Adaptive System 
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waveform. The ó of the second waveform was changed in order to give 
the different set of pitch periods. It should be pointed out that 
this procedure allows for several of the corresponding periods to have 
the same pitch period. This is synonymous to a crossing of pitch con- 
tours m a pitch period versus time presentation. Ideally, the output 


from the systems should be zero, but this result could not be achieved. 


Figures 5-16, 5-17, and 5-18 show the time domain results of the 
input reject test for Shields', the adaptive overload, and the adaptive 
systems respectively. From these presentations it is extremely diffi- 
cult to establish which system is doing a better job of attenuating 
the input waveform. Each system has certain areas that it does well 


in, and areas that it does not do well in. 


The frequency domain presentation does provide a better picture 
of how the systems performed on this test. The input signal spectrum 
has not been included in this series of figures because it would be 
the same spectrum as the one shown in figure 5-11. The output spectra 
for the Shields', the adaptive overload, and the adaptive system are 
Enesemeed in figures 5-19, 5-20, and 5-21 respectively. In all systems 


the general level of attenuation is on the order of five decibels. 


The system shown in figure 5-7 may again be to measure the 
performance. In equation (5.11), if the value of a is less than one, 
this corresponds to an attenuation of the input signal. Consider 
equation (5.16) again. If the value of a is less than unity, Then, 
the logarithm of a will be negative making The value of E positive. 


As a approaches zero, the value of E approaches infinity. Therefore, 
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Figure 5-19 Output Spectrum - Shields! System 
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Figure 5-20 Output Spectrum - Adaptive Overload System; Input Reject Test 
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in evaluating the system performance for this test, the magnitude of 
the error function is proportional to the amount of attenuation applied. 
The error functions are given in figure 5-22, and these error functions 
show several characteristics. The average level of the three error 
functions is approximately the same. Therefore, it may be stated that 
overall the adaptive systems seem to do no worse than the system 


proposed by Shields on the input reject test. 


The next test concerned the performance of the systems in the 
presence of white noise. The white noise was generated by an analog 
noise source, lowpass filtered at 4.7 kilohertz, and then, sampled af 
10 kilohertz. The gain of the noise was selected so that the signal- 
to-noise ratio would be much greater than one. The noise waveform was 
then added to the signal as described in figure 5-6 to produce the in- 


put waveform for the enhancement systems. 


The conclusions from Shields thesis were that the system did nof 
perform well in the presence of white noise. The observation that the 
comb filter changed the wideband noise into noise that was highly 
harmonic is exemplified by figure 5-25. The output waveform in figure 
5-23 is quite different from the input signal. The same statement may 
be made concerning the output of the adaptive overload system, which 
is shown in figure 5-24. The output of the adaptive system as shown 
in figure 5-25 is quite different from the outputs of the other two 
Systems. The input and output waveforms are almost identicat, and 
this is quite encouraging. The adaptive system passes the signal and 


noise combination almost exactly. In the adaptive overload and in 
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Figure 5-22 (b) 
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cope 
Shields! system, the output waveform is distorted immensely due to 
the white noise. On the other hand, the adaptive system's output is 
noisy, but the distortion of the other two systems is not present. 
Figure 5-26 shows the noiseless input signal and the output of the 
adaptive system with seven coefficients in the filter. It can be seen 
That the effects of increasing the number of coefficients in this case 


is negligible. 


The spectrum of the noise used for this test is shown in figure 
I-27, and the spectrum of the input signal added to the noise is shown 
ИГ иге 5-28. Figure 5-29 is the spectrum of the output from Shields! 
system for the case of the signal with additive white noise. It can 
be seen that the spectrum is composed of distinct harmonic bands that 
were not as prevalent in the other cases. The areas outside the 
harmonic bands have been lost in the filtering process, therefore, 
causing distortion in the output. The output spectrum for the adaptive 
overload system, figure 5-30, resembles the input more closely than 
Shields' system, but there is some energy getting lost as the level of 
the spectrum is Jess than that of the input's spectrum. The adaptive 
system's output spectrum, figure 5-31, has only minor differences from 
the spectrum of the input. Figure 5-32 shows the error functions of 
the outputs in the same manner that they have been Present in the 


past. 


There is little doubt that the adaptive system performs better 
than the other two systems from the standpoint of this particular test. 


The adaptive system provides an output signal that may be noisy, but 





59| 


SOGE 


SSION SLIUM 


'$1u91211190) ueAesg 'uejsÁg eAI.Jdepy - suJoje^eM 4nd4no pue (ssəjəsjou) 4ndu] 9gz-g eJnBij 
Al 


u Al "mW. Wi lan À AD 
y "A | 


AMI aM БӨРІЛІ ohh ІІ! 
IT 


Nay ү ЛГУ 


| “ҮҮЧҮ WR 
ААЙ AL AL. ALA 
Ш A A 


vA 





zone 





4591 9SSION 941 UM 
“s]uUS19¡33809 Lanas “wejsÁs aa jdepy - SWJOJBARBM inding pue (ssojesjou) Anduj (4,102) 9z-a eun51j 


ААА ААЙ кад Li 
PHA “ү 


f pot "Wr AM e y 


| | AN у A | MI Pf {ү € A 





1 


y 54 ^j; ap U vr v 





rog 





FREQUENCY (KI LCHERTZ) 





"GG NI G3YYNOS AGNLINOVW 


Me — — 


Figure 5-27 Spectrum of Additive Noise 
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Figure 5-28 Spectrum of Input Signal plus Noise 
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Figure 5-51 
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Error Function - Shields' System; White Noise Test 


Figure 5-32 (a) 
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the other two methods provide an output signal that is both noisy and 


distorted. 


The final test was conducted for the purposes of continui ty. 
This test, as pointed out before, is a combination of tests one and 
two. The system proposed by Shields is presented in a time domain 
representation in figure 5-33. The adaptive overload and adaptive 
systems are shown in figures 5-34 and 5-35 respectively. The spectra 
of the three systems are shown in figures 5-36, 5-37, and 5-38 for 
shields! system, the adaptive overload, and the adaptive systems 
respectively. Upon careful examination of these spectra, the level of 
the spectrum from Shields! system is generally lower at most frequencies 
than either of the adaptive systems or the spectrum of the desired 
speaker shown in figure 5-11. This point seems to agree with the 
phenomenon observed earlier in the input pass test. The amplitude of 
the output from Shields! system in the input pass test was being ampli- 
tude modulated, therefore, the waveform would have to contain less 
energy at some frequencies than an unmodulated waveform of the same 
type. The error functions shown earlier have been omitted for this 
test because they are almost identical and not very benefical for 


comparisons. 


In summary, this series of tests was performed in order to provide 
some other means of system evaluation. It should be stated that the 
fact the adaptive filter did not suppress the noise waveform better 
than the system proposed by Shields is not surprising. The main ad- 


vantage of the adaptive system is fidelity, not rejection. Some of 
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Figure 5-36 Output Spectrum - Shields' System; Combined Test 
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Figure 5-37 Output Spectrum.- Adaptive Overload System; Combined Test 
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the phenomena and errors observed from these test signals were help- 
ful in implementing the systems that worked with actual speech. When 
the listening tests are performed, a more accurate comparison can pro- 
bably be presented. However, before these tests are performed, other 
than informal listening evaluations and speech spectrograms, these 


results are all that are available for system comparison. 





zu 
CHAPTER VI 


COMPUTER IMPLEMENTAT LON 


6.1 Description of Computer System 


The computer work for this thesis was done on a PDP-11/45 digital 
computer. This computer is a highly sophisticated and powerful 16-bit 
word machine with 52 K of core memory and dual cartridge disk drives 


capable of holding 1.2 million words each. 


Other peripheral equipment included fhe Lab Peripheral System, or 
LPS, a storage oscilloscope, a Hewlett Packard 7004 B plotter, and a 
VTO5 Alphanumeric Display Terminal. The system was also equiped with 


analog lowpass filters, attenuators, and audio amplifiers. 


The LPS is a modular, real-time subsystem that houses a 12-bit 
analog-to-digital converter (A/D), a programmable reai time clock, and 
a display controller which includes two 12-bit digital-to-analog con- 
| verters (D/A). With the LPS the task of input and output to the com- 


puter was easily handled. 


The LPS real time clock was programmed to sample an input signal 
at 10 kilohertz after a lowpass filter with its cutoff frequency at 
4.7 kilohertz. The computer was set to sample two channels simultan- 
eously, and the speech signal along with the glottal accelerometer 


signal were put onto the disk in the form of a data file. The samples 
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were stored in an interleaving format so that the time reference 
between the two signals would be preserved. The program that conduct- 
ed the sampling was set to sample up to 3.2 seconds of speech with its 
corresponding accelerometer signal. This amount of time was chosen so 
that most sentences could be accommodated, but with minor modifications, 
The program could be changed to allow any length file. The program 
That conducted Тће sampling also took advantage of the direct memory 
access (DMA) option of the A/D converter, which allowed the conversions 
to be stored in memory at the maximum rates without processor inter- 


vention. 


The LPS also provided a capable display control. Along with the 
knobs and switches provided, a display program was written that was 
capable of displaying and editing waveforms. This feature was very 
beneficial in correcting pitch markings and in viewing processed and 
unprocessed waveforms. X and Y cursors with knob controls were in- 
cluded with an LED readout of position in order to locate specific 


points in the waveform. 


The routine used for audio output was driven from the D/A convert- 
er in the LPS. The signal from the D/A converter was lowpass filtered 
at 4.7 kilohertz and then amplified. At this point the output signal 


could be recorded on magnetic recording tape for future use. 


6.2 Shields' System 


The filtering methods discussed by Shields were implemented on the 


PDP-11/45 using the standard RT-11 Fortran. It was decided that an 
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implementation in fortran would be a slower version, but after algo- 
rithms were perfected, the execution speed could be improved.  Flow- 
charts for the systems described in this section are included in the 
appendix. Since the A/D and D/A converters used 12-bits, the input 
and output data values were in integer form although the internal 


multiplications were done in floating point format. 


There were two different systems implemented for the comb filter- 
ing techniques, and the difference in these two methods was the manner 
in which the unvoiced segments were handled. A brief statement about 
these two methods seems necessary before the systems are described. 
The first method, the attenuated input method, stopped the comb fil- 
tering when an unvoiced segment was encountered. The program then 
began to attenuate the input by some specified constant until the 
next voiced section appeared. The second method, the "inertial 
filter", ignored the fact that an unvoiced segment had been encount- 
ered and continued to filter the input using the parameters of the 
last known pitch period value. The name inertial filter resulted 
from the physical aspects of the processing. The filter could be 
thought of as moving through the input samples with some velocity, 
and without any external forces, the filter and its motion would 
remain unchanged as it moved through an unvoiced section. With the 
exception of the manner in which unvoiced speech sections were 


handled, the two systems were basically the same. 


The filter coefficients could be derived from several sources. 


A subroutine was written in order to let the source for the 
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coefficients and the number of coefficients be determined as the pro- 
gram commenced execution. The choices of sources included the four 
window functions that were used in the original system proposed by 
Shields. The window functions were the rectangular, the Hanning, the 
Hamming, and the Blackman. A second choice was the coefficients from 
a lowpass filter designed by the Parks-McClel lan Algorithm that was 
available on the computer. The final choice was one that allowed the 


user to type in any coefficients desired. 


After The coefficients had been determined, the input buffer was 
initialized with the first section of data. As mentioned in the pre- 
vious section, the waveform had been sampled using two channels. The 
signal was stored on one channel, and the pitch marks were placed on 
the second channel. This type of implementation could be considered 
wasteful from the standpoint of storage, but from the aspects of the 
display, the waveform along with pitch marks could be viewed very 
easily. As the filtering commenced the center coefficient checked 
each value on the second channel for the pitch marks, and when a mark 
was encountered, the filter was changed. A pitch table file contained 
the distance between pitch marks and was used to furnish a new pitch 
period value after a mark was encountered. If the value of the pitch 
period was greater than 20 milliseconds, the section was treated as 
an unvoiced section. The unvoiced speech procedures described above 


were then implemented depending on the system. 


The pitch table was also marked to denote areas that were silent. 


These silent areas were detected as mentioned in Chapter ||! from an 
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i.c measurement over a segment of data. These silence marks were 
used only to speed up the processing at the beginning and ending of a 
sentence. If a value from the pitch table indicated a silent area, the 
output of the filter was set to zero, and no multiplications were per- 
formed. Procedures for handling the silent areas that appeared inside 
the sentence limits were not thoroughly investigated, and this could 


be a topic for future research. 


From all practical aspects and to the best of the author's know- 
ledge, these systems were identical to the ones described by Shields. 
The execution time for these systems was on the order of seventy-five 


times real time. 


6.3 Notch Filters 


Another method that was investigated for the speech enhancement 
problem was a notch filter implementation. Shields suggested that a 
notch filter be used with the pitch marks of the unwanted speaker to 
reject the unwanted MON 

In the comb filter implementation, the impulse response of a low- 
pass function was used. Using the principle of duality a notch filter 


implementation yields a frequency response for one of the possible 


filters as shown in figure 6-1. 


The high pass prototype filter was designed by means of the Parks- 


McClellan Algorithm. There were several parameters that had to be 


varied in order to decide what type of prototype filter was needed. 
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Figure 6-1 
These parameters were: 


1. Filter order 
2. Stop Band Width 


5. Transition Width 


The filter order of the prototype high pass filter was limited to 
less than twelfth order. This value was determined from the implemen- 
tation. If a number of zeros equal to the pitch period were inserted 
between coefficients, then the effective order of the notch filter 
grows quite rapidly. For example, if an eleventh order prototype 
filter were chosen and The pitch period were 100 hertz, then, The 
order of the notch filter would be 1100th order. A filter length of 


this magnitude is on the borderline of the quasi-periodic assumption 
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made for the speech waveform. For the most part, the high pass proto- 


type filter order was between seventh and ninth order. 


The second consideration was the width of the stop band in the 
prototype filter. This parameter can be linked directly to the 
width of the spectrum of the speech waveform at the various harmonics 
of the fundamental. If the stop band is too narrow, then the energy 
at each harmonic from the unwanted speaker will not be sufficiently 
suppressed. On the other hand, if the stop band is too wide, the notch 
filter may suppress the desired speaker and cause distortion. The 
above discussion relates back to the basic tradeoff for speech enhance- 
ment. Again, this tradeoff is the desired speaker distortion versus. 


the undesired speaker attenuation. 


The final primary consideration, the transition width, was related 
to the first two considerations already mentioned. The transition 
bandwidth can not be made sufficiently low without ¡increasing the or- 
der of the filter. As the transition width is decreased without chang- 
ing the filter order, the ripples in the pass and stop bands increase. 
Therefore, the limit for the transition width is related to the maxi- 


mum ripple that can be tolerated. 


From the prototype filters that were designed the transition 


widths were relatively large due to the limitations in the filter order. 


After the coefficients from the prototype high pass filter had 
been calculated, these coefficients could be used in The same programs 


as used in the comb filtering implementation. 
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6.4 Adaptive System 


The adaptive filtering systems were implemented much in the same 


manner as those systems mentioned in the previous section. 


Since the methods used to implement the system that corrected for 
the overload problem encompassed the methods used for the system that 


did not, only the former system's description will be covered. 


The initialization was the same as in the previous description, 
but an array was used to hold the information on the spacings be- 
tween the coefficients since this system did not use uniformly spaced 
coefficients. Initially all spacings were set equal to the first 
voiced pitch period. As the filter moved through the input data, and 
pitch marks were detected, the spacing array was continually updated 
to reflect the correct spacing for each coefficient. If an unvoiced 
area were reached, the scheme was to perform the inverse operation 
described above. The coefficient that moved into the unvoiced area 
would retain the last voiced pitch period spacing, and the filter 
continued in this manner until all coefficients were clear of the 
voiced area. In unvoiced areas the input was attenuated as in the 


previous system. 


Another array was used to correct for the overload problem. This 
array contained the initial starting locations in the buffer after the 
filter had been changed. If a coefficient were about to enter an area 
where oVerloading would occur, that coefficient was set To zero, and 


the rest of the coefficients were rescaled so that the output would 
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have a constant level. The array that contained the starting locations 


was used to detect when an overload situation existed. 


The rest of the system involved only bookkeeping operations that 
were needed to handle the rules used by the system. The system was a 
little slower than the one proposed by Shields and operated on the 


order of ninety times real time. 


In summary it should be pointed out that the fortran implementa- 
tions were not designed for speed. Their basic design philosophy 
was one of user interaction with the programs to ensure correct pro- 
cessing. If speed had been a factor in this implementation, several 
items could have been changed. First, the input waveform could be 
composed of a single channel, and since the data samples were only 
12-bits long, the higher order bits could be used for the pitch mark 
information. With this change in effect longer segments of data could 
be stored in core memory, eliminating several input/output operations. 
The filter implementation could take into account the even symmetry 
of the FIR filter and eliminate one half of the multiplies per output 
point. The addition of assembly language subroutines to do the calcu- 
lational aspects of the program may also speed up the programming. 
With the implementation of these changes, the systém begins To lose 
the user interaction capabilities that have proven to be very helpful 


This work. 
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CHAPTER VI I 
SPEECH WAVEFORM RESULTS 


In order to determine whether or not the systems would perform as 
planned on real speech waveforms, several sentences were processed by 
The various systems. A true evaluation of the performance of the 


Three systems involves listening to the processed outputs when speech 


Signals are used. These systems will be tested in extensive listening 
tests at a later time, but from informal listening results and spectro- 
graphic analysis will be discussed in this chapter. It should be 


pointed out that it is very difficult to describe the results of the 
various systems with speech waveform inputs. The descriptions and 
comparisons of the systems will be made with characteristic words that 


are not quantitative in the least. 


The initial parameters used in processing the speech waveforms 
originated from the information provided in Shields! thesis. Shields 
concluded that the optimal value for the number of coefficients was 
seven. This number of coefficients provided the best compromise for 
the tradeoff between desired speaker distortion and undesired speaker 
separation. He stated that the Blackman Window provided the best set 
of coefficients from the window functions used, but he also stated 
that the differences in the results between the Blackman ana Hamming 


54 
Windows were slight. With these facts in mind, these parameters 
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could remain fixed while the enhancement systems were varied. 


Before the results of the tests similar to those in Chapter V are 
discussed, a few phenomena characteristic to the individual systems 


will be introduced. 


7.1 Comb and Adaptive Filter Results with Speech Waveforms 


There were several phenomena observed in the different systems 
That were not noticed in the test signal section. In The attenuated 
input method of Shields! system, an attenuation constant that was too 
small caused the output waveform to appear chopped. This chopping 
sensation resulted from unvoiced sections that were being attenuated 
by a great amount, and this caused a drastic change in the amplitude 
between a voiced and an unvoiced section. At attenuation constant of 
0.3 seemed to be the smallest value for the constant that allowed the 


chopping to be imperceptible. 


The inertial filter implementation of Shields! system did not рго- 
duce the chopping sensation, but a soft reverberation was introduced 
which was audible in quiet. This reverberation could be directly attri- 
buted to the fact that unvoiced sections were processed by the comb 
filter with constant parameters. Both the chopping sensation and the 
soft reverberation were phenomena that could be attributed To the 
unvoiced speech segments. Although the time intervals for unvoiced 
sections in normal speech are small, the phenomena Occurring IR ese 
unvoiced sections seemed to carry over into the voiced sections and 


disguise the actual results for the voiced sections. 
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The same types of tests performed on the test signals in Chapter V 
were implemented on the actual speech waveforms. These tests were eva- 
luated by informal listening and by spectrograms. Spectrograms will be 
included as a final comparison on the methods that were implemented. 
In these Tests the two systems implemented for Shields' method were 
equivalent since the desired speaker consisted of a totally voiced 


waveform. 


For the input pass test all systems performed equally well for the 
most part. Тһе most noticeable characteristic of Shields' system was 
a slight hoarseness in several words. The adaptive systems were char- 
acterized by a slight reverberation. All output waveforms had very 
similar spectrograms with only very minor differences from the spectro- 


gram of Тће input sentence. 


The input reject test seemed to show that the adaptive systems 
provided about the same attenuation of the undesired speaker, but the 
outputs from the adaptive systems had more reverberation than those 


from Shields' system. 


In the white noise test Shields! system seemed to perform a little 
better. Тһе "гееду" sound mentioned in Shields' thesis ” was present 
in the output. In the adaptive systems the output had a "buzzing" 
sound that seemed to be more annoying than the distortion in the output 


from the comb filter. Again the spectrograms indicate no major differ- 


ЕЙСЕс for this test. 


The combined signal test has been presented with spectrograms, and 
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the combined signal spectrogram is shown in figure 7-1 (a). The two 
waveforms were taken from two different male speakers with approximately 
the same average pitch. The two signals were aligned in time so that 
both speakers would be talking at the same time. This alignment would 
show how well the filtering performed when both speakers were talking. 
Figures 7-1 (b) and (c) show the spectrograms of the two waveforms 
before they were added, and the spectrogram in figure 7-1 (b) has been 
designated as the desired speaker. Figure 7-1 (d) is the spectrogram 
from the output waveform of Shields! system with seven coefficients, 
and a Hamming Window as the lowpass prototype. Finally, figure 7-1 (e) 
is a spectrogram of the adaptive system's output waveform. The output 
waveform in this figure was produced with seven coefficients and a 
Hamming Window as the lowpass prototype also. Both figures 7-1 (d) and 
7-1 (e) are very similar with each one having only minor differences 
from the other. This type of result was normally encountered in other 


sets of sentences that were examined. 


As mentioned earlier in this section, the adaptive systems pro- 
duced a characteristic buzz. The origin of this problem was investi- 
gated by examining the input and output waveforms in the areas where 
the buzzing sound was most prominent. These waveforms are shown in 
figure 7-2. In the output waveform it can be observed that the major 
peak in the input waveform is being clipped and distorted by the 
adaptive system. This phenomenon was not observed in Chapter V when 
working with test signals. In the case of the test signals, the pitch 
period was allowed to change very rapidly in some areas, and the 


adaptive filter performed satisfactorily. It should be pointed out 
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that in the case of the test signals, the impulse response of the 
vocal tract did not change. Therefore, the distortion that is shown 
in figure 7-2 (b) may be due to the increased variation of the impulse 
response in short intervals of time. This phenomenon, fortunately, 
does not occur frequently in norma! speech, but it may occur often 


enough to be an annoying problem. 


7.2 Notch Filter Results with Speech Waveforms 


The notch filters were implemented to reject the unwanted speaker. 
Several sentences were processed, and the overall opinion of the notch 
filter was that it did not improve the quality of the output waveforms. 
The notch filter was implemented by itself and in series with a comb 


Beer. the notch filter or series combination, at best, performed 
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only as well as the single comb filter by itself. In most cases the 
results of the notch filter or series combination of notch and comb 


were worse than those of the single comb. 


The distortion of the desired speaker was higher in the notch fil- 
ter implementation. This increase in distortion was probably due to 
the wide transistion width that resulted from the design specifications. 
The series combination also had the problem of "harmonic overlap". 
The "harmonic overlap" problem may be explained in the following manner: 
If the teeth of the comb filter are set to pass the desired speaker 
and the notches of the notch filter are set to attenuate the unwanted 
Speaker, there may be some areas in the frequency domain when the 
teeth and notches will overlap. This problem can occur even when the 
pitch of the two voices are very different. The overall result of 
this combination of notch and comb has to be lower in performance when 
the "harmonic overlap" occurs. Because of the results from several 
processed sentences the notch filter implementation was not pursued 


mariner. 


In this chapter the descriptions of the speech enhancement systems 
that were implemented were discussed. This chapter was included for 
the purposes of showing that the systems did in fact provide some type 
of speech enhancement. These systems will be tested in another effort, 
and at that time, some more analytical results may be revealed. A 
demonstration tape for all of the tests described in this section along 
with other sentences was made and may be found in the Digital Signal 


Processing Group Library, M. l. T.. 
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CHAPTER VIII 


RESULTS AND OBSERVATIONS 


The next two chapters will summarize the results and conclusions 
of This thesis. Chapter VIII will deal with the overall results while 
Chapter IX will cover the opinions and general conclusions. 


The general goals of this thesis were fulfilled. The system 
that was described by Shields has been fully implemented on the com- 
puter and was tested thoroughly for correct behavior. As an alterna- 
tive to the comb filtering methods, an adaptive system, has been 
proposed, implemented, and tested. A limited amount of comparison was 


performed with test signals and actual speech waveforms. 


The adaptive methods can be summarized in the following manner: 
The method originated from the time-varying structure of voice pitch. 
The filter was modified to conform to the variations in pitch in or- 
der to pass the desired waveform with as little distortion as possible. 
It was shown in Chapter V that the adaptive system was able to pass 
a time-varying speech-like waveform when the impulse response of the 
vocal tract was constant. From this viewpoint, it was shown that the 
adaptive methods surpassed the methods formulated by Shields. On the 
other hand, it was stated in Chapter VII that the adaptive systems had 
some problems when the impulse response varied rapidly in short inter- 


vals of time. I+ should also be pointed out that the adaptive system 
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was more complex and more time-consuming in the computer implementa- 


tion than the system proposed by Shields. 


The adaptive system stressed fidelity of the desired speaker, and 


apparently does no worse on the rejection of the unwanted waveform. 


Finally, it was observed that the notch filter and the notch and 
comb filters operating in series did not provide any substantial im- 
provements in the overall quality of the output waveform. |n fact, 


both of these methods showed substantial degradation in some cases. 
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CHAPTER IX 


CONCLUSIONS 


In concluding this discussion the following items should be 
stressed: First, the method proposed in this thesis, the adaptive 
filter, works well in some cases in the speech enhancement problem. 

In other cases, the method has its limitations. The good and bad 
points of the adaptive system were summarized in Chapter VIII. There 
is little doubt that the adaptive filter reduces the level of the un- 
desired speaker in all! cases. The desired waveform was sometimes dis- 


torted and occasionally unintelligible. 


Second, the method used for pitch detection as developed in Chap- 
Ter IIl appears to be a very good approach for the pitch detection pro- 
blem. The general pitch detection problem was not considered in this 
thesis, but the entire field of automatic pitch detection algorithms 
has been the topic of much research for many years. This method may 
possibly be improved by first taking the derivative of the glottal 
waveform, and then, using the peak picking algorithm on the resulting 
waveform. There is still one major drawback in this method: The 
transition areas between voiced and unvoiced speech are not easily 
irked whether automatic or manual processing is used. There is a 
general lack of structure in these areas, and a decision has to be 


made as to whether or not a mark is needed and where the mark should 
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be placed. This method of pitch detection also proved to be proficient 
in the automatic detection of silent areas. Although the maximum bene- 
fit of knowing where these were located was not taken advantage of, the 
silent area detection helped speed up the processing at the beginning 
and ending of the sentences. Overall, this method was generally accur- 
ate and swift. The feasibility of the method may be in question due 
to the fact that the glottal waveform is not as accessible as the 


speech waveform. 


Third, it can be concluded that the notch filter used to reject 
The unwanted speaker does not provide substantial overall! improvement 
when both attenuation of undesired signals and faithful reproduction 
of the desired signals are considered. This conclusion is not sur- 
prising, because the notch filter is basically the dual of the comb 
filter. Based on these observations, the notch filter should not 
be used because it introduces the serious distortion of the desired 
waveform. The conclusion is that it seems more feasible to pass the 
desired speaker with as little distortion as possible than to atten- 


uate the unwanted speaker to a higher degree. 


A major observation that should be examined is the question of 
how effective these methods are in speech enhancement. Ift was con- 
cluded that for normally mixed speech waveforms, there is a limit To 
the amount of enhancement that can be achieved with either of the 
filtering methods discussed. IT is believed that even optimal ly 
designed filters for specific speakers or even for specific sentences, 


would only provide limited improvement because of inherent overlap 





USO 


in the signal spectra. 


The remaining areas open for improvement in this type of approach 
to the problem seem to be in the area of polishing the adaptive system 
so That it approaches this limit. The difficulty in this area lies 
with the fact that much more information concerning the speech wave- 
form will have to be revealed before the techniques can be improved. 
Until that time, alternative methods should be examined for the pro- 


blem of speech enhancement. 
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APPENDIX 
ИИ introduction 


This appendix contains the actual computer programs that were 
used in implementing the systems described in this thesis. They were 
included as an appendix for several reasons. First, if these systems 
need to be implemented again, these programs will provide a reasonable 
starting point. Second, if these systems are to be compared with other 
systems, there will be no question as to whether or not the systems 
described in this thesis have been correctly implemented. Finally, 
These programs were placed in the thesis instead of a separate note- 


book in order that the two sections would not become separated. 


These programs were written in RT-11 Fortran, and all non stand- 
ard fortran routines will be included for completeness. IT should also 
be mentioned that some of the features of RT-11 Fortran may not Trans- 
late directly into statements usable with another Fortran compiler, 
but for the most part, the RT-11 Fortran is compatible with other 


versions. 


The PDP-11 structure stresses the use of modularity in program- 
ming, and this concept has been used to a great extent in These 
implementations. For each module or program, a brief flowchart is 
included with the computer code. This will probably be a helpful fea- 


ture if the programs in this appendix are to be dissected. 
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Figure A-1 shows a general layout of the overall systems with the 
modules shown as blocks and the computer file names that are used as 
inputs and outputs to these blocks. Figure A-1 is used in the pro- 

cessing of waveforms exactly as shown if there are no pitch errors to 

be corrected. However, if pitch errors occur, then these errors have 


to be corrected as shown in figure A-2. 


These computer programs were not written with speed of execution 


as a primary concern, but with user interaction capability and compre- 


hension for new users as the primary concerns. 
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THE PROGRrM LOOKS FOR A ZERO CROSSING IMMEDIATELY 


Ae VER A RIFFFR SITET HAS BEEN PERFORMED, 


ПАРА А А Ра USE TO DENOTE A SILENT AREA 
AND yHEN SET (=1) A PITCH PERIOD IN THE PITCH 


TET EI ENTE NEGATIVE SIGN TO DENOTE SILENCE. 


fee) tT ALIZATION 


ІМІРСЕК Плгвиыс (256605), Л(1259)«Б(1280), ВОҒ(2560) 


Or EEUN Sia ZLAC, STARI; TH., END] 


COON GAY BUF 





pn í) 


CEILL 


2n 


ee IND 


sioe 
[ATA DUF«TxXJe«START/ZZ2F G0*0«141417/ 


CATA IVAR ЈУЛН О КУАК, ТК + СЕ МБ ЈР доб/2 ,2+,2+1%0%0/ 
ПА Бо 16М (2. *RF1:PTICH,SSS1) 

G [ASS ICON (23, RIFF IT TER. ТРЛР) 

CALL ASSIGN (U4 ' KK1; PITCH, DAT?) 

eet beet 2 (2519595504. IVAR) 

ПО У ЈОНА ТЕ 2 (3000.724U«JVA^R) 

BEEN DESEJUE d (251. 256» U KVAR] 

НИ ле осе) СС TO 300 IDO WE QUIT? 

y=] INC, RESEY THE PUINTER. 

THE BUFFER 

(Оде = 16 

О АЕ ((ЛТЕПТ-(.) % (СТІТ-1) ж 256))%,151%256) 
CONTINUE 

THE PEAKS ann ZERO (FOSSINGS, 

CALL PEKS (A,BeIVAR,SUM) 


ЕГЕП ЕПОС (лв) 


eee ECU AIF PITCH 


COIS THE OVvERFLOW FLAG SET, 


THIS 


ПА 0) GO TO 100 

SECTION PICKS UP THE ZERO CROSSING FOR AN OVERFLON. 
uc] TES, FIM THE FIRST ZERO. CROSSING, 
ЈЕ(САСОЈ)„Ме«р) 60 TO 70 

ЧЕ =) 

ШОШО Г пар) ос То 250 


(SUME 





71 


“0 


lun 


Bn 


тер 


m Tu 
EENI ESTART 
Soe =e 1 


INDEX =2,. 


ЕӘР(ІМОЕХ) - 4995 


PUNTI LM / OG 


Pet ee OA). Os TNE KU = KU ~ 1 


а СТАН 
ае е ШИ е ин. 
Mee CG era, 0) 
Ко = =) + M IVES, 
Zu COE DOMI EAR 
VRITECS*UVAR) Me 
p 

~FLAG=0 

cO TO 230 
ПАС рат GO 
TH = SUM (IR) 
ШЕ ТЕГЕ. 0) 
ЕЛЕ = 1 

GO TO 200 

Piet 201) LT. TR) 
ez) 
ОИЕ а) 


„zJ+1 


ШТ Ont) CU 


ICALCULATE 


cO TC 200 


CO Tc 180 


-160- 

(EGUNO ITs 
О И ET EE PITCH PERIOD. 
ІРЕЗЕТ THE STLERT РОЈМТЕК, 
POITJOUN, 

! MARK THF PITCH FPOCH, 
ITIS SECIION DETERMINES THE 
!IREC« NO. THAT 


ICCPRESPONDS TO THE SPEECH FILE, 


ІКІС = 251 
AFC OURS IrIS A SILENT AREA? 
CHANGE PI/CH PERIOD TO A NEG. VALUE. 
Ttt SILENT FLAG. 
IRLC 


!GLFAR THE OVERFLOW FLAG. 


TO 250 !NO, APE WE THRU? 


Of TAIN THRESHOLD, 


GO IC 120 


INO, IS THE PEAK VALUE< TH? 


(NG, GO CHECK ZERO CROSSINGS, 
17tRO CKOSSIMG? 
INC + BUMP THE COUNTER. 


ШЕТ PORU BOUFFER? 





24 


190 


250 


Е ТОЛЕ 


SED NSTARTIQ4950 "LE ¿STORE START LOCATION. 


LFLAG=]1 [SET THE OVERFLOW FLAG, 

СО TO 260 'GG, READ ІНЕ BUFFER, 

ANOT: zJ E GT CL OSSING OCCURRED, 
pcm START OCU TTC TT PITCH 

INDEX=2 * J IC/ı CULATL POSITION FÜR MARK, 


PUF (INOFX) = ОЭБ КК THe PIICH EPOCH. 

WU Ce INDEX/Zomnm6 rene lore tant. NUMEE R 

TR (ei 255, .,ER.IFDL X) KU = KU = 1 

Wa Osment War UO, SPEECH WAVEFORM, 
IEC sd) Inne = 251 

ПИТА А TECO OI GCE TE 190 TIS THIS A SILENT AREA? 
(= <j 4 М TESS DENOTE 1TH NEG. VALUE, 
7ZFLAC = Q 


MITES у wa TREC 


START = Емо] IRESET TH£ START POyYyNIER, 
J = I + 290 IR; SET JY 3Y MINIMUM PITCH. 
со то 230 

J=I+] 

hore 1/956 


ШТ есу ГЕ» 1) IR = IR + 1 
CU T0 160 


TF ¢JFLEAG.EG.0) SIAFT=SSTART-1289 


C URITE Out THE BUEFEK OIC THE FAiLF Om THE DISK, 


МКІТЕСІ"Кулр? (RUF(IC + (CULA 71)*256) )  IDz14256) 





ern 


CONTINUE 


IE THE RUEFER, BUF, FCR THE NEYT CALL, 


ane 


МОРЕ СЕТ 60 
PPF (TC) =,D 
CONTINUE 

CU TOS 

ENDFILE 2 

t NDFTLE 5 


FNOFTLE 4 


сле 





ТО 


Den 


Initialization 


Caleulate= energy 


Compute TH 
TH ОМ ) 


Zero array 
Е 


Mark location 


of peaks in 
array F 





FIGURE A-6 
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UIT: Реке FOR 750110 


THIS SUZPOUTINE FINDS Thp PEAKS OF A WAVFFORM THAT HAVE 
EN STORTO IN A RUFFER ¿FAIBOUF), THE RETURN CONTAINS AN 
ARRAY "Fu WITH THF VALUE OF THE PFAK IF ONE OCCURRED ANU 
ERS GREATFR THAN ZEIND. IF NO PtAK OCCURRED AT THAT 


ESUNI. HHPEN A ZERO 15 RETURNED. 


PROGRAM PARAMETERS: 
THE ERERGY SyMMpTION IN [O-LOOP 3 HAS A SCALE FACTOR THAT 
USED TO PREVENT OVERFLOW. CIN THIS CASE 100. IS USEU 


KI TIE SCALE FACTOR.) 


THE ARKAY "SuMu CONTAINS THE THRESHOLDS THAT 
WERE CALCULATED Erom THE ENERGY MEASUREMENT. THESE 
THRESFOLNS ARE RETURNED 10 THE “IAIN PROGRAM FOR USE IN 


EPNERMIWVTMG THE CORKFCI HEAKS, 


Reet RESPOLE SETTING That WAS LINKED TO A PARTICULAR 
MENS REVENT CF ENERGY WAS DETERMINED Ry Ex/MINATION OF 
BLUVERAL vVAVEFORMS AND PLCTTING tNFRCY VERSUS TRUE 
ШОЛ тл! pres, THIS AREF OF THE PRUGRAM MIGHT БЕ IM- 
ОМЕР SY A MORE METAIt Es SIURY OF THE DATA FOR THE 


ПРЕТИЛО 3sHRESHOLO МА! ЏЕ>, 





m 
~ $ 


~ 


ем 


ШІ 


en 


ео 


= 


PARAMETER "IFLaAG" JS USE, TO DENOTE A NEGATIVE 


ee, TFLAE = ı <ü NEGATIVE SLUPE 


IFLAG = 9 - POSITIVE SLOPE 


SUBROUTINE PFKS (Е. р. IVAR: SUM) 
INLTLALTZAITOAN 

NEL EHI BLFL2SEUNLELI28U),F 01280) ,SUMI5) 
COMMON FATRUF 


COCO EOD А = 1,5 


PERFORM ^M ENERGY SUMMATjCN OVEM ?56 POINTS OF THE 


CLOTTAL WAVEFORM, 


ПО С 11511,2 
SUMXZSUMX« CCCFEATBUE(CCIA-1)*512) 1B) -2048) /300) X2) 


CONTINUE 


mero ЕЛІТ OF PROG, ASSIGES A TRCSHOLD VALUE BASED ON 


BSEUENERGY SpMMATTON COhRFELTULD ABOVE. 


YFOSUMX,NE»«0) GO ТО 10 !SUMXZO MEANS SILENT AREA 


Thin = 0 

CO TO SS 

IF(SUMX - 12000) 15,4940 | 
TECSUMX = 6500) 20.254355 


nom бп.) 25.20.50 


“тч 220nn 


COTO SS 





Bt 


C FIND 


EAN 


150 


Ec 


I70 


OD 


Wiz = 2600 

OTO 55 

к =з 

60 10255 

IF (SUY =- 165000) 85450450 
IR = S00 

CO TO 35 

THR = 5800 

sun (JA), = THR 

CONTINUE 

THE PEAKS FOR THE SEGMENT OF DATA, 
Bem Uso t= 1.1200 

EF (KI) =0 

CONTINUE 

K=1 

ке = 1 

IFCPATBUF( KES) = FETEUFCK)) 134041504180 
JFLAG = 1 

cO FO 170 

kh = K t 2 

CO TO I120 

IFLAG = 0 

IB rs 2552.2 


Реке 2 


ПР оте РАТЕЈРСРУ со то 200 


ШТ ШЕЕТІСЕ0;1) G0 1C 205 








Cy 


Meet ee ВОРСОКУ 2 ТРОЈЕ). GO 10 205 


ed = SREB (Kee) IFOUND THE PEAK 


il 


Er AG 
CO TO 21/45 
IFLAG=O 
poc Ke. ч 
CONTINUE 
RE TURN 


END 
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E 


ARSS 










Initialization 
Zero array 
co. 


Set up IFLAG 
for initidi slope 







Markall zero 
crossıngs with 
negative slopes 


FIGURE A-7 





ГЕ 


OE 


ПАРАДЕ о Ок 750110 


MTS” SU3SROUTINE FINLS ALL ZERO CROSSINGS OF A WAVE- 
EU 7 .i5PED 3N A nÜFFER«. ТІР A CERO CROSSING OCCURS AT 
SOVE FOIMT« THAT POINT Is MARKED WITH A ONE IN THE ARRAY 


uch, IF NOT, A ZFRO IS LEFT IN THE ARRAY, 


FARALETERS USED: 
NIFLAC" TS USED IM THIS SURROUTINE TO DENOTE AN 
ТЕ ОЕ THE DATA THAT 15 NEGATIVE, 


J.E. IFLAG 


j DATA VALUE 1S NEGATIVE. 


TENG =° OATS VALUE 1S POSITIVE. 


SUSROUTINE ZFROS (C,[) 


ZUITIKETZAITTON 

INIEGER CArBııF (2566, ,Ct1280),D(1280) 
COMMON CATBUF 

Bez) =, 1280 

CETT) f= Q 

CONTINUE 


ive) 





Eo 


50 


C MARK 


40 


ot 
en 


p 


Ta 


RQ 


гап 
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IF(CATHUF(N) - 208€) 350440450 


IFLAG=O 

GO TO 60 

A ZERO CROSSTING 

Међу = 1 

ја РА 

W N | 

BEE 256509), CO TC 200 

(0 TO 25 

IFLAG = 1 

IF (CATRUE(N) - 2056) /0,40,RO 


МИНАС O OGOTO 72 


C(N2) = 1 
IFLAG = Q 
ћ = N + 2 


K2 = Ne +4 
JF (M.6rt.256n) FO T( 200 
GG TO 60 


Пе се ,1) бб ТС 75 


сусу =5 1 
IFLAG = 1 
EOS TS 
RE TURN 


PRO 
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ADDNS 


Initialization 
E file names 







20 
Fill buffer 1 


30 
ТА ЛА а И 2 


Bone 
were mower ee 
Result——buffer 3 


5/0, 
Write buffer 3ondisk 


Me M о 
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TITLES  ADONS.FOR /50120 


THTS FROGRAM ADDS TWO FILFS TOGETHER WITHOUT A SCALE 
FACIOke ІТ 15 wE^NÍ tO BE USED ON AN ODD - EVEN FORMATO 
(IE. ONF FILE орг FORMAT. THE UTHER FVEN). IF SWITCH(U) 
IS UP THF PROGRAM ALLOWS THE INPUT AND OUTPUT FILES TO 
PE NAWES, IF SWITCH to) JS DOWN, THEN THE PROGRAM 
ASSUPES THE DEFAULT NÁMES THAT ARL USED IN SERIES WITH 


ШИЕ СОСКЕ ЕТЕ SHiELDS SYSIFM. 


SURRCLTINFS USEH 


JSSwWI (SEF REVArp FOR EXpLANANTÍON. ) 


JMITIALIZAT IOP 
INTEGER DA1BUF(2560)+CATBUF (2560) +BATRUF (2560) 
ГАТА РУДЕ Јул о КУАК/2 + 2 ,2/ 
ПЕРЕЕ(ТУ1) 
EO3MAT(* ++ PROGRAM CONTROLS = ~ - SWITCH (0) *) 
WRITF (742) 
FORMATI? t4 *sWITCH(OC) DOWN = DEFAULT FILES USED?) 
WRITE (743) 
EGRPATOS ee INPUT = SIGNAL.SSS*) 

RITEC.) 


ООУ en AUT PUT = SIGNAL ДТ”) 





e 


3 


10 


4 


a). 


С 


prr 67.5) 
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EORMAT(S s. *cWITCH(o) UP =- 1/0 FILES MAY BE CHOSEN”) 


FAJSF PAUSE? 

LE O 6 

CALL ASSIGN (2,'RK1:S8IGNAL.SSS') 
CALL ASSIGN (24 *RÉ1: PIE CH, UAT*) 

CALI. ASSIGN (Us ' HKL:SIGNAL«DAT*) 
ACH Ton 


SD DESIT) 


pORMAT(* pr OF INPUT FILE <FILNAM.EXT>+) 


АКІТҒ(7,68) 

FORMATO? ter °) 

LA SSIGN (19, RKTSSIGNAL.S5S*,-14) 
ПИ о TON 2 RKI FITCH DAT?) 


WRITE (749) 


TO INC?’ mE CF CUIPUT FILE <FILNAM,EXT>”) 


NHRITE(7,8) 

(ALL ASSIGN (Uu, *HK1:SIGNAL.DAT' 214) 
ГЕРТМЕ FILF > (201,255. 0, ЧАВ) 
ШЕБІПІ ЕЦЕ ~ (298156. 0, JVAR) 


[DEFINE FILE u (252.256.U,KVAI) 


PROCESSING BEGINS HERE. 


OTITIS) GO 16 500 


O LATIN 


REAn(2*IVAR) (DATRUPR (JB + ((TA-1) ж 256)). ІБ21%256) 
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en CONTINUE 


wees СЕ је С 
KEAn(R*UVAR) (CATBULCID & CCIC-1) # 256))«1021+256) 
4n CONTINUE 
Meee TRE TWO TOGETHER, 
ГО UO Jf£f=1+255%. 2 
ATROF CJE DATBOF ITI FT CAIBUFE CIE) 
PATRUFC(CIE+1) = CATELF(IE+1) 
qe CONTINUE 
ШЕТКЕ THE OUTPUT 
ROSO IF = 1,10 


DITE (HKU) (BATEUF(IG + ((IF-1) * 256))416210256) 


An CONTINUE 
CO TO 11 
эсе РАВНЕ Не ITURN TIT GS OFFS 


ENOFILE 3 
FNOFTLE ü 
STOP 


END 





REVAMP 


Initialization 


Review previous 


РАДО РАЈЕ Бо Ке 





Accept file 
names 


300 


Quit 









Fill buffer 





Oddchannel 
transterred 


even value 


> 4052 









Set even value 
to zero 








Calculate pitch 


Write buffer 


onto disk 


FIGURE A-9 
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ТІНЕ: REvAwP @, FOR 750130 


WBROLTIINES USED: 
ISSW) - THIS SUBROUTINE 18 uRIT!EN IN ASSEMBLY 
LANGUAGE AND ALLOWS FOR ТРЕ DISPLAY REGISTER SWITCHES TU 


РЕ USED AS PROGRAM CONTKCULS. 


WOR SWITCH (1) UP 


INSI PANTY OUTPUt PILES MAY BE SPECIFIED. 


mom StL IIcH (a) DOWN 
ШЕТ = SIGNAL OAS 


CUTPUIS =- 1. Co MANDO г PITCH: TRB. 


THIS PROCRA“Y IS USEL 1C REVAMP A SIGNAL FILE THAT HAS 

FEEN EDITED. THE OUD VALUES ОҒ THE SIGNAL ARE PRESERVEU 
ІМ TITS TRANSFORVATION. TRE EVEN VALUFS OR PITCH MARKS ARE 
ZEROES JMLESS THEY ARE СЕ ТО 4075. АТ THE SAME TIME 
INES TANCE BETJEEN PITCH MARKS IS MEASURED AND 


EXOREE Jw PITCH.T^B FOR LSAGE IN OTHER PROGRAMS, 


INITIALIZATION 
INTEGER DAIBUF (2560) eBATBUF (2560) +STARI.A(256) 
НИ о И стилу с теме 0/ 


LATA ІУЛК,ДУЛРАКУГК/2,2.2/ 





C 


^3 


"Т 


& 
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REVIER PREVIOUS PTTCH.TAR FILE 


CALI ASSIGN (94 *KF1:PTICH, ! AR* ) 
LEFTEE FILE 2 (1000, Pr Ur VAR) 
RET (2 JVAR END = 2) М. IREC 
ie ee a Oye INES 271 

cO TO 1 


БІМШЕТІСЕ 2 


NOW REINJTIALIZE, 


WRITE (745) 

A ISA ISE DI DOWN = DEFAULT FILES USEN.*) 
WRITF (744) 

FORMATO? *, * TNPUT - SIGNAL.DAS: OUTPUT =- SIGNAL, DAI?) 
KWRITF (745) 

КО (” ТҮТЗСИТІСІІ0) Up - 17/0 FILES "AY BE CHOSEN?) 
FAUSE 'PAUSE' 

ЦИА Оре 1) (С TO 6 

ИНЕП ERSSTEN (2,t«*K]: STGNAL.«OAS*) 

CALL ASSIGN (2,  HK1: SIGNAL.DAT*) 

CARE ASSIGN (4 'KKI:PITCH. TAR?) 

COTTO 10 

CRITE (767) 

FORMAT (' МАМЕ Ср INPUT FILE <FILNAM,EXT>*) 

WRITE (74h) 

Dm SUC °) 

CAL ASSIGN (oct 2: SIGNAL. DAS - 18) 


ККТІР (7.49) 
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9 FORMAT (* "ҚҰМАРЫ Ор OUTPUI FILE €FILMAM,EXT»*) 


KRITE C7 ae) 
CALL ASSIGN (3+"*RKIsSTGNALeDAT® , -11) 
CAL ASSIGN оре "БОКЕ РАТОВАЊА“) 
10 Пи А ЈЕ РС Der 251, 2565, U, Ivan) 
CEFINE FILE 7Z (2921 265,0, ЈУАН) 
[EFTNE FILE & (1000,2. Ое КУАК) 
„VAR = 2 
Perret HE THRU? 
11 ПОН А О 252) СО TO $00 
НИКЕ | {HE РОРРГЕЗ WTIIH D/IE FROM INPUT, 
UM "uS 3.10 
ОУС МАРТ (UAIBUF (CJS ж ((1І-1)ж256))%:1%256) 
2f CONTINUE 
Meee NGFEX THE ODD SAMPLES AS THEY ARL, TRANSFER THE FVEN 
C SAMPLFS AS ZERO UMLESS TEEY EQUAL 4095. ALSO THE 
С ГЇ1ЇТСН.ТАВ FILE ıS BEING WRITTEN OUT IN THIS LOOP, 
ИОС ПИСЕ 1,2589, 2 
РАТОЈР (71) - DATBUF (71) 
ПИ пЕШЕСт КЕ СЕ. Н075) Бо TO 25 
FATFUF(IJI1 + 1) = 0 
CO TO SC 
po РАТРЏЕ(ЈЈ + 1) = 4055 
Wes (C1141) = СТАҒТ)/2 
Bee о E 11/2596 


ПЕ БА) roe fo +1)) JU = JU «1 
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МИ Ба ЈЕР + ои! Сл соба те КЕС, NO. 


C CHECK FOR SILENE ARLAS 
ШЕ е Ете ооо 60 TU 40 
ПИ ИЕС ОУ СЕТО 40 
ИҰ 2M o* -1 IDEEN IS A SILENT ONE, 
а «КІТҒ (4 "куде) Ма ОБРС 
STADT = 11 + 1 
í! CONTINUE 
TE THE OUTPUT INTO SICFAL.DAT FILE 
ЕТПЕ SNB 
„RITE(3JVvARY) (RATBLF(I3 * ((12-1)*258))4 I5z14256) 
10 CONTINUE 
SET START 
START = START = 256¢ 
CO TO 13 
30n ENDEILE 2 'TURN TRIACS OFF. 
ЕШОЕТГЕ 3 
ENDFILE 4 
STOP 


ENO 
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PA TE EDO 


DOCUMENTATION FOR SHVO3 AnD SHVOH 


АКЕТКАСТ 


THESE PROGRAMS IMPLEMENT ІНЕ SYSTEMS DESIGNED AND 
ШҮГІЛТЕІ) PY VADE" SHIELCS IN AN MIT MASTEH'S THESIS FROM 
ES. Ihe PURPOSE OF THESL SYSTE»S WFS TO SEPARATE A SIGNAL 
FROM NOISE (USUALLY ANOTHER SPEAKER). THERE ARE TWO SYSTEMS 
MENENIE, AND THESE DIFEER AN THE MANNER OF UNVOICED 
SPFFCH PROCESSING, SHVOz USES THE ATTENUATED INPUT METHOD 


VEJLE SEVO4 USES THE ¿NERT¡ PL FILIERING METHOD, 


SURROUTINES USED 


1. WINDOW ІМТҮРЕ.К) - THIS SUBROUTINE ALLOWS FOR THE 


SER TG PRFSCRIRE THE WINCC FUNCITON DESIREN IN THE PROGRAM 


TO FESTGNATE tHe VALUE CF K FUR THE FILTER LENGTH. 


=. ОЦИ а A у THIS SURROUTINE CALCULATES THE 
MF FICIENTS OF THE FiITLR PRESCRIBED ІМ THE WINDOW 
AA ROUTINE AND STORFS THESE VALUES IN AN ARRAY "A" THAT CAN 


KE OF MAXIMUM DIMENSION GF 15. 


OTPER PROGRAMS USED JN CONQUNCTION WITH SHIELD, 
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T "RFS^"p - THIS PROGEA« SAMPLES THE ANALOG WAyEFURM 
AT 10 kHZ, ON FACH OF @ CHANNELS (4 AND 18) AND PLACES ATS 
VUTPUT CN FILE STGNAL,YXX , THE CHANNELS ARE STORED ІМ 
INTERLFAVING FORMAT, 
ILIBINNF-L- tg) - nDD SAU, TE VALUES. (ODD FORMAT) 


mur (10) 2 EVFN SAMPLE VALUES, (EVEN FORMAT) 


2. SFPAR =- THIS PHROGEARM SEPARATES THE TWO CHANNELS ON 
ENECFILF AMD PLACES EACH Ct, A SEPARATE FILE, 

INFLT = SIGNAL. XXX 

DUTPITS = CHANNEL (6) - SIGNAL«.SSS 


CHANNEL (14) -~ PITCH.SSS 


BOTH APE ASSIGNED THE FORMAT OF A CHANNEL (4) ONLY 
mt. ШЕШЕ; ODD SAMPLES = DATA? EVEN SAMPLES . 0 BY 
MULT.) IF OFSTRER THE S: COND OUTPUT CAM pE PUT JNTO EVEN 


FOP AT, 


Die J U00202 72 THIS | РООКАМ DETECTS PITCH FROM IHE 
ЕСЕН. 555  FILr. THERE ‚RE TWO OUTPUT FILES FROM THIS 
C TAI FIRST. PITCH,'!^E IS A TaßgliLAR FILE OF THE VALUES 
EL. Ti EMI TOU пето по о О As AN INPUT TO SHIELD, 
ENXCOND, PJIJTCP«DAT IS A FILE EGET СЕП” 10 MARK PIECH 
E ue ШОЛ AVE ERRE Elli. PITLH DAT HAS TWO VALUES ONLY» 


ШІ ОН 4095), THF VALUE 4(ЦИсе 15 USED TO DENOTE AN ENDPOINT 








NS 
ШАЛ FITCH PERIOF, THIS IS USED AS AN INPUT TO ADDNS, 


Euch, PDT UTS TINE YEN FORE AT. 


4. ANDNS = THIS PR(CRAM ADNS TOGETHER TWO FILES, 
SICNAL.YXX AND PyTcH.U^l. SINCE UNE FILE» SIGNAL«XXXr IS IN 
Ог РОМАТ AND PITCH.DAT IS IN EVEN, IPF CHANNELS ARE KEPT 


SEPARATE BUT PUT INTO ONF p ILE CALLED SIGNAL. DAT, 


ЕРОТ TO SHIELD 
1. SIGNAI SI = FROM ADDONS 


2. РІТСН.ТАР - FROM PLIVO? 


OUTPUTS FROM SHIFLO 


1. OUTPUT.LAI 


At GOR ITEM 


ШЕ ТГЕ PROGK AY FIRST PROVIDES TER USER WITH THE CHOICE 
DE TYPE OF WINDOW APD THE PARAMETER Ke CK TT COEFFICIENTS 


ARF USET In THp FILTER): 


Ca ME ОЛКО ЛАШУ THE COEFFICIENTS- FOR 


MP FILIL AND STORES THEM JM ARRAY "A", 
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8. ІНЕ МЕХТ SeciTON ¡1LOWS FOR THE INITIALIZATION ОР 


The LOGICAL UNITS THAT WILL Bi USED IN THIS IMPLEMENTATION, 


m EO DORMIT > 


СОВЕТИ “SIGNAL FILE, 
ПИПИН АЦА АЈА 5 = сео EFORTTEE PIlICH FILE: 


C, LOGICAL UNTT & ~ LSED FOR THE OUTPUT FILE, 


NOTES THE INPUT AND CLTPUT SIGNAL FILES, ON LOGICAL 
ENDS 2» AND ug ARE DIVyILEL INTO RECORDS OF LENGTH 256, IHE 


EEUU rJEE IS DIVTIDET TINTE RECORDS OF LENGTH TWO, 


E DUFFER ІС IHNEN SET Up INITIALLY BY LEAVING Ik 
RECORDS FILL Ор ZEROS (IR IS COMPUTED IN THe FOLLOWING 
MANNER: (172 Or FHF WINCCY LENGTH) IN SAMPLES ж 17256 
(HFCORL/SAvpLES) + 1 Ift THE  NUFEER OF SAMPLES IS МОГ А 
frets: Or 256.) THEN THE SIGNAL IS TAKEN FROM THE FILE ANO 
PI-LMCED INTO THE FUFFER, THE NUMPFR OF RECORDS READ IS 


VAPIABLF OFPENDINGO ON THE FIRST yALUE OF PITCH AND ON K, 


5. I" THIS PROGRAM Tee VALUt T IS USED AS A BUFFER 
moire. TEET VA UE ШЕТТЕ НОТЕ СЕМТЕКН OF THE FILTER 
NET IS BEING USED. Jol SP O о РА ПАО НЕ THE FIRST -SAMPLE 


IN THE (ATA SECTIUN OF IhE FUFFER. 


6. THE Vali OF PITCI IS READ AND THROUGH THE SAME 
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E15 or MEASUREMENTS AS FOUND Im THF INITIALIZAITION 


PROCESS. JHE ALGORITHM CHECKS TO SEF IF THE EXTREME VALUES 


NEEDED sh THE pIFFERENCE ECUATION ARE WITHIN THE BUFFER, 


Т, ЕИ БЕСОТ ТЛ ЕГ ЕШ ЕЕЕ ЕН IS SHIFTED: THE VALUE 


ШЕН КЕГКОСТТІОІГЕП дМр MEL (ATA IS RFAD IN FORM THE FILF, 


8, NOW THe CALCULATIONS CAM. BE АЈА РОТОР. ІНЕ 
ВЕРЕЕКЕГСЕ EQUATloOM IS Jhi8LEMENTED AND AN OUTPUT POINT Y IS 


BERUF ATEU AND WRITTEN ONTC THE OUTPUT FILE. 


ere lle PROCESS CONTINUES UNTIL THE PITCH FLAG IS SET 
ALTING THE МАКК IN THE INPUT FILE. A NEW VALUF OF PI!CH 
ES DD IS THEN READ, AND TIEFE FILTER 1S MODIFIED FOR THIS 


CANGE, 


TO, A SLIDE OPEKATIG;, OCCURS WHEN THE VALUE OF I is 
BIST ALL TO ACCUMMOOATE ТҮР CURRENT FILTER. THE BUFFER IS 
PEPOSITIONED AND 4 PFSE! SC THAT ALL INPUT VALUES NEEDED BY 


ПН | ТЕ“ ARE PRESFNILY In THE BUFFER, 


MN P| oc SINGELSIITENBES URTTL A SILENI ARFA IS 
EN ll O O AND rHEM AFTER FILING THE OUTPUT FILE WITH 2ЕКО5 


RF PRUGRAM HALTS. 





mp 


eye Ww OS 








Initialization 





Accept CON, 
Attenuation 


CALL WINDOW 


CALC COEF 


20| READ M from 
PITCH.TAB 


50 
Take 
Measurements 


Fill buffer 
initially 


Clear PFLAG 





FIGURE A-10 





EOS 


Sia CS 


Clear ZFLAG 


READ M from 
PIS 


no es 


Set ZFLAG 













no Unvoiced es 
29 segment? 120 
Clear LARGE Set LARGE 
Take 125 
Measurements 
no e 






Finished 
buffer ? 





Assign 
GO TO 100 25 0 KXX 
GO TO 200 129 J 






buffer need es 


shifting? 
159 






@ Shift buffer 


@ 
EIGURE A-10 (CON'T. 





mU 






Slide buffer 





190 

Assign 140 to 

KXX 

200 

Set uplocation 

tor Output 

220 

no yes 








Implement 
difference equation 


result —Y 


Is 
ZFLAG set? es 





es 
OUTPUT SCONE OUTPUT: 2048 


INPUT 






Add zeros at 





beginning 


GUIOT: Y 


ШЕШЕ O СОМ Т.) 








2503 


S Rn Ae 


270 
Output tor 2nd 


спап: 2048 









Тите 
to write 
output? 







WRITE OUTPUT 
cnto disk 


End 







400 


EN 


Bump | 






Change 
puce 








Set PFLAG 









GO TO KXX(125, 
140) 





300 






Add zeros to 
end of buffer 





129 






Ооо 


FIGURE A-10 (CON'T) 
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TĪTIF: SHYOS mop OI an omer ER 


KO S IFDP t T К TS PY V, SHIELDS 


NS rFROCRA" UsEs THE WETEOD OF SHIFLDS To PERFORM 
THE SEFARATION OF A SPELLFER AND NOISF BY MEANS 
EN [LIoTT^AE Coria FILTER, THIS PROGRAM USES THE 


ПИТИ ЛЕТОМ wETHOn БОК Up VOICED SECTIONS, 


SEBRCLTIPTPS USED: 
WINDOL (MTYPF,K) 
oer (CUTYPET KY AL) 


АЕ 1205 USED: 


ARRAYS: 
1. ИСЕ ООВ THE INPUT SIAMELE VALUES, 
ЖЕРЛЕ ЕТКТТІГС ШИЕ CUOLFFIETENTS, 
Ze CUIPUT - TEMPOR„PY BUFFER FOR OUTPUT VALUES, 
EIArFLEzs: 


ПС ле SEN TO INITIALIZE OUTPUT FILE 
SO THFT STEP CISCONTINUITY DOES МОТ 
OCCUR, 
2. ZFLAG - uStN TG DENOT BEG. OF SENTENCE, 
КИНЕ С с р TO TP NOTE MER PITCH PERIOD, 
ШЕГЕР SEN TO PENOTE UNVOICED SECTIONS. 


Es CON = IHF ATIETL ATION CONSTANT FOR UNVOICED 





БЕСІГІ. 


-190- 
SECTIONS, 


MET INT CATION 


BINENSION A 15) 

INTEGER DATRUF(4096) + START +D0UTPUT (256) + Z Е(АС, БРЕ Аб 

UDINE SEEN. LARGE 

ГАТЛ ОАТРЦР, ері Лбі(0( ТРИТ/Ч8095%0,1.256%0/ 

E JAI VAB IVAR ома лече че“ 

I О АРА O SUDPEDIJNES=EDORPSIMITIALIZATION, 

‚RITF(7,1) | 

Poet t's VALUE CE AITEN: CONSTANT? S14«« F5.2»*) 

BEBEPT Oy) COT 

FORMAT(F3,2) 

CALL ASSIGN( 2, *RKISSTGNAL, DAT") 1OUTPUT FROM ADDONS. 

erie ASSTEN (2, PITCH TAB"; 3) "OUTPUT FROM PITDETO+ 

G PD S TENE, TOUTPLT" +16) OUTPUT OF THE COMB 
IFILTER. 

Bone FILE > (251.256. 0, КИЛЕ) 


[LFINE FILE 2 (1000,2.Uel1 VAR) 


С VINDCL OP TERMINATION. 


CACY PINRDOW (NITVPE +4) 


ПА С САТЕ COEFFICTENTS, 


CALL СОБЕ CNTYPE*F*AsL) 


г ЕРЕСІРТ THE LENGTH UF THe OUTPUI FILF. 


I UD. e251 + (Kh жг) + i 
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LEFTNE FILt nN ALIMI IS 255, Un JVAR) 
ILL THE PUFFER INITIALLY 
eee tHe FIRST Val UE FRUp THE PETCH TABLE FOR 
END TJIALI2ATICN "jJUPPOSES, 
20 ККА бе ут M 
KU J| í THIS VALUE TU CCM FNSATE FOK TWO CHAN, 
=IAanSt2rM) 
C JS M TOO L^3CE? 
MO "U.GP.uOg) GO TO 20 
ҮСТІ MAY SET Up BUFFER, TAKE "EASUREMENTS* 
Sn C-L-1 
BLZ + M) + 1 
IR=JV/256 
ПИРА aos NES) J]ik=IR + 1 
TT NIGER 
Mette eta ty ОК н) 
ПЕТИ. Ike PUFFER FROM WELRE THE POINTER STARTS, 
CO AN II=(—(2+#T1TR)+1 + 10 
EEG KVARI (DATIRUƏ r CU € CCII-T) *256)), Ј=1,256) 
en CONTINUE 
= eee Pitch PB FILES 
C roll paeGinm THE PROrTESSIMG, 


Eu Een SEES DREHEN «e CHANGE PITCH, 


tt 


100 FFLAC 


1 


Zac 0 
НА по ЈА У M 


Пи ТО 7FLAO = 1 
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У Аб СОМ) 


mS M TOO LARCE> 
I АНИ ОО 10-140 72500 CORRES, TO PITCH<50. 
ss TURN СЕР ЕЛ ТЕК. 
120 LARGF 7 .TRUF, 
106, ПОРТЫН 502 0” 1700 ТІ PFLAG SET? 
NE LAG IS MOI SrT, 
EE E ver TERDUGH WITH TPE tUFFER? 
J (J O iO) POO TC 127 
U TO, GC АГТОКМ AN CUTPUT POINT 
bSSTGM 325 TO КХХ 
(0 то 200 
(о LE ARE THRU PUFFER+* READ IN A NEW BUFFER, 
127 ПО РЁ = аб 
IF (KVAR,GEe252) GO 10 300 


KEAn(2*'kVAR) (OATPUF(ICB ж ((99-1) ж 256)) «ІСВт1%226) 


122 CONTINUE 
129 ) = 1 
СО ТО 125 


K LAS NOT TOO LARFE, PROCESS THE SIGMAL, 

коп LÄTCF = .FALSE, 

АКЕ PEASUREMENTS, 
ШО ( (C 72) uomo tIV=1/2 (LENGTH OF THE WINDUW) 
IJR=IV/2E6 
BI пора МЕ ФМУ к= ТК * 1 


IP=1R*256 








= Обе 
МИН РТЈСН Рблб 5:77 


Ju IF{PFLAG) vo TO 100 
C pO, (CES THE FUFFER FEED TO BE SHIFTED? 
150 ЈЕ ((I * JV),LT.U09« ) GO TU 160 
ES: SHIFT THE PUFFER 
ala =) 
KVARZKVAR-(0 ж TR) 
CO 158 JJ=1,16 
1F (KVAR.Gt.252) GO TO 500 IARE WE THRU? 
A Onin ( dun ((JJ—1)*256) ).J21+256; 
Шер CONTINUE 
Meee Ser THe POINTER I. 
ELO START = (16 -(2 * IM) * 256 
INS (TI O [D = START 
cO TO 190 
fms) 1 702 SMALL? 
160 DET GC со то 1¢¢ 
Bes, SLIDE THE BUFFER. 


170 Ор I 


.VEY 7 JVFX/256 

ЕО 0м. VEX} JVEY = JVEY + 4 

KVAR = (KVAR - 16) - JVEY 

LG 175 IDv z 1316 

READ(2'*KVAR) (UATPUEF( IDX * ((IpY-21) 2560)  IDXz21:256) 


175 CONTINUE 





zd 
T ESET I 


T = JHOLD + (256 ж JVEY) 
1°¢ ASSTCN 14n TO KXX 
BE OUPUTATTON AND DUTPUI SECTION 
enn INO ZT 
zn NE Се БЕЈ Оле“ 220 
JINO = IND - 256 
@0277042]10 
22 ЈЕ (LARGE) Go 10 26g ILARGE SET? 
O, CALCUL ATF AND OUTPUT POINT, 
Y=0,0 
ГО 240 ММ-1і,! - 1 
Ш СИТЕТ - Г(К Ж — MM) * M) 
Y=Y * (A(*wM) x DATELFCIPOINT)) 
م24‎ CONTINUE 
EE IS THE FIRST OUTPUT TO BE wRITIEM, 
ШЕР ЕС 02) “ОО 1€ 255 
Mires CENT ALLow PRASTIC STEP INPUT, 
ПИО ТО = 3, CINU=2 2 
(АНРИ Thu) = 2086 
CUTPUT (IU +.1) = O 
any CONTINUE 
SETO ТЕПЕРУЕР, 
РЖИЕГГТТІ I-4F OUTPUT POINT 4 
еке МОС = Y 


GO TO 270 





a0 


Е-Е 


270 


= 
ЕЕЕ ОС) GO 10 265 


СЕРИ мр) --2048 


ROTO 2770 


f 


GCUTPUTIND) = (CON + (DATBUF(I) ~ 2048)) + 2048 


CUTPUTCLTND 4 1) = O 


K L CE READY TO DIMP OUI UT? 


JFOY'!'DL4NE255) CO Ic 280 


C BES, WE DIEM 


VRITECS* JVAR,.END-ZUOG) COUTPUT((KKK),KKKX714256) 


LEO 1=1 + 1 
JF(nAT8ƏUF—(41),FG.80S:) PFLAG=I !CHANGE PITCH? 
I-l + 1 
220 СОТО (129,140) 
ЖЕНІС SECTION IS USEL TA p JLL CAIBUF WITH ZEROS WHEN 
ПА ЕС. 
500 ОООО АУЕ оте IDETERFINE HOW MANY 
ЕСЕ Е 256 
Peep (hx = СОЛОД 1) 6255) )=0 IRECORDS NEEU 
ie RE. 
505 CONTTNUE IAUDFD WITH ZEROS, 
S10 CONTINUE 
FARGE) GO TU 12797 
CO TO 159 
IES. GUIT 
Zu > ENDFILE 2 


A 3 









pus 
И Е Е A Ра i 
ENDFILE 4 





ШЕ 








C 
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TITLE? SHyvog AUTHOR: Re FRAZIER 


ADAPTFD FROM THESIS FY Ve. SHIELDS 


THIS PROGRAM USES The METHOD GF SHIELDS To PERFORM 
ft SEPARATION OF A SPEFEER AKO NOISE BY rE AWS 
EX! CL IGTTAL corlg FILTER, ПИРА ШОРА USES ТИЕ 


ATTERLATTON METHOD FOR UNVOICED SECTIONS, 


SÚBROLTIVNES USED: 
ENDO. (MTYPE,.K) 


COEF (NTYPEekeAeL) 


PARAMETERS USED: 
ARRAYS: 
EEE COUNT fT SalheE INPUT SAMPLE VALUES, 
ЕВ 7 - CONTAINS The COEFFICIENTS, 
РОТ С ТЕСРОПКАКТ BUFFER EOR OUTPUT VALUES, 
VARIAFLES: 
1» SFLAG - USED TO INITIALIZE OUTPUT FILE 
ORTA STEP FISCONTINYITY DOES NOT 
OCCUR., 
0.  ZFLAG - SEN TO TENOTE BE. OF SENTENCE, 
а. PFLAG = (ISLO TO [ENTOE NEW PIICH PERTOD, 


DEA RGE - (СЕП ТС ГЕМОТЕ UNVOICED SECTIONS, 





=108= 


C) 


( МА А РА ЛО 


ГІМГМ5ІСМ n(15) 
INTEGER DalBııf (4096, ,START,OUIPFUT(256) »ZFLAG,SFLAG 
LOSICAL PFLAr,LAKCE 
САТА PCATBUF, SFLAGYOLTPUT/4UD6*0,.14.256 «n7 
САТА ЈУДКо ЈУЛ И КМАК/ 2 + 2 + 27 
C CALL THE FOLLOWiNe SUBPCUTINES FOR INITIALIZATION, 
CALI ASSIGN(2,°PKI SIGNAL, VAT!) 1O0OUTPuyT FROM ADDONS, 
ОИ ТЕЕ (PETER, TAB Ys) OUTPUT FROM PITVUe. 
GADI ASSIGCNCU OU IPUTtU ҰБ) IOUTPUI OF THE COMR 
ıFILTER. 
ШЕРТЕР Е 2 (291,256, КҠУАК) 
БЕРКЕ FILE = (L000,2.Us IVAR) 
C WINDCh OEFTERMINATION, 
CALL WINDOW (MTYPE hy 
Bee eCUIL AIF COEFFICTENTS. 
ere ОРАО МЕР РЕН) 
( SPECIEY THE LENGTH UF THe OUTPUT FILF. 
WAS 251 4 (Ko -—y + 1 
ОШЕТ КЕ u Cl TIE14)256U0sJVAF ) 
ШЕРТЕР THE PUFFER INITIALLY 
BEES D RE FIRST VALUE FFOp THE PITCH TABLE FOR 
NEPNITIALIZAIIJION PURPOSES, 


гв PEAD S IVARI (| 





ГОЗ 


Sis М 


с NOW ы 


E 


ILL 


E 


C 


ОМ p 


топ 


ен. 


s 
f THIS VALUE TU COMPENSATE FOF TWU CHANNELS. 


М-Ідр5(2жм) 

n s S Бо VILL BE USED FOR UNVOICED, 
TOU EO 

ТЕ (М, БЕ, ч00) 60 T6 20 

Е MaY SET Up nUFFEH, TAKE "EASUREMENTS, 
C-L-1 

Руби жог) + 2 

INR=-IV/?56 

J ili o NERV IR =EIR 4 1 

IP-TR*?56x2 

IS(TP + 1) „ (к * B) 

THE RÜFFER FROM WPELE THE POINTER STARTS. 
CO 20 І1іс(2кТЕ)41%16 
el) Ж ((ГІІТ-7) x256))«J9214256) 
CONTINUE 


IDE I ss E IP IIC; IA FILE- 


BCIN THE PROCESSING, 

EIN EF Р. ТІРТ5 FLAG DEI, WHEN WE CHANGE 
ZFLAG = 6 

HOLD = M 


MESES TARO M 
и ETS 0) 7ELAG = 1 


P=JARS(pxrv) 
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ЕЕ M TOO LAKCE? 


IF (".b6rEr.unp) M = MF OLO 
C TAKE FEASUREMNENTS, 
Мз (С^?) £ M) + 1 IIVz3/2(LENGTH OF THE WINDUM) 


TR=TV/256 

J Пра ел је ТІ) ІСІН ғ 1 

LP=TR* 256 
EN tif PITCH FLAG SET? 
149 JECPFLAG).60 TO 100 
НИ LCES THE RUFEFR TEED TO ВЕ SHIFTED? 
15n nag ПО) CO TU GU 
NECS. SHIFT THE RUFFEF 

JHOLD= I 

KVARKVAR=(9 ж ІК) 

ЕС 2221116 

ПЕК ОКУЛ Ct, 252) CO 10 500 MARES E THRU? 


MES US UKUARY (СРАТБРЕСО + ((Jjj-1)+*256))  Jz1+256) 


о А CONTINUE 
ET I 
150 SUPISUMSUO*TINM)) *—256 


BI OLD-STÄART 

GO TO 206 
EE S I 100 SMALL? 
jen J (T GE Vv) PO TO cin 
FS., SLINE ITHE ВОРГЕЕР« 


170 о RST 





ADS 
O = JV T 


1 


VEY = JVF¥/256 
ПИ ОЕ ари ле Бе смех VEY = JVEY + 1 
VOR = nr KWNR « 16). JVEY 
ПО РАНИ yy = Део 
NS А о то Стен] уже об |) + 170Х=1+256) 
NIE CONTINUE 
ЖЕЕГ5ЕТ I 
TD MOND I POE ем) 
NEOOUPUTATITON AMD OUTPUT SECTION 
zn N | 
210 IF(IND.LE 258) GO TC 220 
ШІ) КУ = 256 
EOS 2170 
ШЕТ CALCULATE AND OUTPUT POINT, 
Quo p Beine ea, GO TC 253 
Y=0,0 
a ee AN 
OTT I JD и + 1 2 М) М) 
Ү=Ү + (/(ММу ж (DATLUFCIPOINT) = 2048)) 
EC CONTINUE 
ИШ CO TD;S ThE FIRST OUTPUT TO BE WRITTEN? 
253 J IOo [AG EO.0) CO 10 255 
C YES« LOVT ^ Low PRASTIC STEP TRANSIENT. 
poen NU = qa (INI —2)s 2 


Fee ll (LU) - 2085 
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ПИРА y 1) s-Q 
pou CONTINUE 
Seno sc FOREVER 
ZEE ITI" THF OUTPJT POAJl!l, 
255 INO ОЕШУ ОСОТ ТГ ТМ SS ІРТУТҮ + .5) + 2048 
КЛ ЛЕ ОСОКЕ OU] J IT (IND) = 2048 


270 JF (DATBUFCT % 7).60р.4075) ОПТРИТСІМІ + 1) = 4095 


1 
с> 


EPI Ен ва ОВО REIT PUTCIND #1) 
Mere WR READY TO pUMP OUTI UT? 
ОИЕ ЕЕ GU RIC. 250 
MES) WE ARE! 
AN (DUPPOT(FEKK) Í KKK=1+ 256) 
2۴ J=1 + 1 
IF(DATBUF(1j),EQ,u095) PFLAG=]1 CHANGE PITCH? 
J=I 4 1 
eS GO TO 140 
Mees SECTION ADDS ZERUS TO OATBUF WREN CALLED. 
300 TO IO TILA IS 
ЖОШО LAST Ab 


LATRUFLULAX 4 (CCOILAx -1) 255) ) 22048 


406 CONTINUE 
410 CONTINUE 
CO TO 159 


МЕ Se CUIT 
400 ENOFILE 2 


ERÖETLEE 3 
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те | 
Pe Cie) LE y 
СОЕ 


F [40 
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een 


Initialization 


Select 
100 " Fılter 650 









PARKS 


MCCLELLAN 





600 


HAMMING ARBITRARY 


RECTANGULAR 





BLACKMAN 


500 


Scale coefficients 


FIGURE A-11 





1F 


ج 


INES. COLFECEU® FIZOG 


THIS SuBPOUT' INE T^KES Th; vALUES THAT wtRE HEAD BY WINDOW 
ШІ СЛІСІТТЛІР5 THE COEFFICIENTS FOR THE APPROPRIATE 
WINDOW IHAT WAS RFQUFSTEL. DIESEN SEI HESEICOEFFICTEN TS 
TO The “AIN PROGRAM JM ARFAY "A", 
BEEINITINNS: 
То = ARRAY CONTAINING COEFFICIENTS. 
21% NTYPE = THe VALLE ГНАТ СОККЕЗРОМРЗ TO THE 
WINDOW OP FILTER DESIRED, 
Aa К = THe yALUVE TprT DETERMINFS THE LENGTH 
OF TIHE T PULSE K OF THE WINDOW 
AND THE MUMPER CF COEFRICIEMTS CALCULATED. 


це | =? *« K + 2 


SUBROUTINE COEF UNTYF ES Ke«A*L) 
ENS TON A(15) 


ШОЛ ОР 5. 


C1 


po 0.0 
(ОМТТМЈЕ 

dil 5. 11159 
CCE OC 
ШОИРИ ғо 


(0 ТС 1700.200. 500.400. 600 50), МТҮРЕ 
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С MANNIRG WINDOU SEIECTEN 


ШОО — pO 110 I = 1,L-1 
hep) 5-s«tcosqpcesppert)7zt)) 
CCIE SS SCALE + MIL) 
а а. CONTINUE 
CO TO 500 
C RAMMING WINDOW SEL ECTECe 
DI [posce ET 
Atl) = ,5qu - (.46%((С5((2жРІжІ)/І))) 
SONS SCALE + ACI) 
210 CONTINUE 
CO TO 500 
C PLACKFAN WINDOW SFLECTED 
son БИ О т = 1,1-1 
По он осе о i I Ze I + (,085x(CO0S( (aP 1# 
ТЛЕ | 
СЕ SCALF + NACI) 
510 CONTINUE 
CO TO 50C 
C PECTANGJL ^R vVINDOV SFLECTED, 
np ОШто 1=1.L=1 
ACI) = 1.0 
A SALA + ACI) 
41r CONTINUE 
MICO"RGTE THE SCALE FACTCR, 


0 DOC Ugo $4,422 
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All) = ALTIZSCALE 


520 CONTINUE 
CU TO IOC 
EE CAREIIRBARY SET OF GUFFFICIENTS MAY RE TyPED IN, 
600 CAL] ASSIGN (51, 'kKl1 2%, -1) 
les Ba NN RS 
NVAR 2 1 
KEA^ (5*NyAR) (ACTI Y). TY = 1.2%К41) 
FNDFILE 5 
CONTO AO 
C PARKS_MCCLELLAN COEFFICIENTS ARE PErD IN FROM FILE, 
ESO (АГ АЗИЈЕ 1) 
EEM (S) NEIG 
AN = (NFIL + 1)/2 
IMM (S) ТУ Ман Y= NY) 
NDS 
ГО 660 Әт 1.ММ 
ПСМЕТЕ І-ІІ) 2 A tJ) 
660 CONTINUE 
TD RETURN 


END 
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WINDOW 





Initialization 


Accept filter 
type 







Accept number 





of coefficients 


FIGURE A-12 





p 
ж 


10 


20 


40 


u. 


ТІЛІК? МІМПОм.ҒОР, 741220 


THIS SUBROUTINE at LOLS FLUR POSSIPRUE WINDOWS, 
INES -=PECEELLAN FILTER, OR AÑ ARB. FILTER TO 

s SI [TI CTEO AND A vACUE (F K (PROPORTIONAL 10 THE 
tENGTF OF THE wWIw^Ow) TG FE SELECTED FOR THE MAIN 


PFOGRAM TO USE, 


SU8SROUTJNE yrNDOW (1 TYPE,K) 

ТЕ (7\10) 

FORMAT? tor TYPE IN FUMBER CORRFSP. TO WINDOW! s /ı 1%: 
' II j pi An 2.) BERCHK,=3, RECT,SUV ,% 
"ccn BARS =MECLELLAN Z 6 (11)%,/) 

КЕЛТ (7,20) «ТҮРЕ 

ОО 

iret hel hy oo) 

FORMAT(S eetttit VALUE OF K=? (2K + 1z LENGTH) *) 
READ (7,40) У 

FORMLTLI2) 

RE TURN 


ENO 





¿10 


LNE 


Initialization 


Set UP array 


S with next 
pitch value 


Compute filter 
length 





Zonen 9 





ШІ 


e 


THIS 
lj 


AREA 


TSU“ 


ME 


ME: SEXUEL OR 750315 


SO T opini Jl Di ori FILTER AT THE 
MENG OF ТНЕ РЕСЕКИ ИССИ THE ONSET OF A VOICED 


AFTFR AN UNVOICED SRE Ae 


J ITIALI ATIDN 

See CU АЕ ЛА РА SA 
INMENSA 

O ome 

Re Ines (ІРІП 2) 

CONTINUE 

САНА АЈА ОР TE FILTERS 

ЕО GF Keo Tet IRENFW ISUM, 


RETURN 


EID 





ST 


CALC 







Initialization 


Implementation 


of difference 
equation 


FIGURE A-14 





C 


= j 


ENTE eC Nie БОР 7502515 


ето SUSPOUTINE JS USED WITH THE WAIN PROGRAM NONVOL 
РУТЕ USED TO CALCULATE AN OUTPUT POIN!, 
ШҮЗЛЕРТЕПГС5: 
ARPAY&: 
1° DIE ORT REN Ihe COEFFICLEDTS 
I j ir n CONTADA THe INPUT DATA 
БІ = о ај“ ТО SPECIWGS PETKEEN THE 
CORE TE ENS. 
uni iAFLES: 
1° T = POINTER OP THE CURREMT PROCESSING IN DATHBUF 
ZUR К - \VALUF THAI (CFTERMINES THE NO, OF СОБЕ, 


ЕСТІСІ SUUM TURNEO TO THE MAIN PROGRAM, 


eUT PUT OR FET FTICIER FROM ONE POINT. 


SUBROUTINE C^LC (BL ATBUF: SIr TOUT) 
[IMFNSION A(15) 
INVTFEGES DATBUF (4076 ).5(19)0+.0UT 


ЕЕ Ае THE SUT VADE, 
Е = DATO !PERFOHY FIRST MULT, 
ТМӘГХ = T 


NS UNI =| > (2 RD A £ NPERFORM REST OF MULT. 





ar 


INIFX = 
iu et 


( ONTINUE 


INDEV -~ 


CACI VY 


IOJT = IFIX (Y 


RETIREN 


FIND 


4- 


stly -1) 


DATLUFCINDOEY)) 


- di 


oD) !IRUUMD THE OUTPUT VALUE 





p 
TT: ADAP1.DOC 


DOCUMENTAITUM OR NONVO1 AND МОМУО2 


ЛЕСТВАСТ 


fab oe PROGRAMS JM (| EMENT THE DIDIT AE: SYSTEMS 
RUE A TES IN THIS THES1S. NONVOL CORRESPONDS TO THE 
ADAPTIVE OVERLOApD SYSIF" WHILE  "ONvO2 JS THE ADAPTIVE 


ШСТЕМ, 


SUPROUTINES USED: 


A, NOMVOY 


1. WINDOW (NTYPLıK) = THIS SURROUTINE /LLOWS FOR THE 
USTR TC PRESCRIBE ІНЕ WINDOW FUNCTION, PARKS-MCCLELLAN 
FII TER, OR AN ARBITRARY FILTER DESIRED IM THE PROGRAM AND TO 


Pees ATE ТРЕ VALUE ОҒ K FcP. IHE FILTER LENGTH, 


=. СО (NTYPFs KAL) = THIS SUBROUTINE CALCULATFS JHE 
BERFEILTENTS OF THF FJI TER PRESCRIBED IN THE WINDOW 
EEUODOUITNME AMO STORFS THEM IH AN ARRAY "ar THAT CAN BE OF 


МАУ ТУОМ DIPENSION OF 15. 


D CALL puma ERU STK TOUT) - THIS SUBROUTINE 





I 
NS GNF VALUE OF Ipk OLTPUT PER CALL. TIT РУМА КИТ ЕЕЕ 


5 TIVr FILTER DLEFERFNCE FOUATION, 


4. FILIN (КАМЕМР1.5.151/М) - THIS SUBROUTINE 
IT TTIALIZES THE FTLTER AY THE BEGINNING OF THE PROGRAM AND 
ПИЛЕ СЫЗРТ OF д vOJCFD ARFA FOLLOWING AN UNVOICED AREA, 
ШЕ FI; TER SpACINCS ARE ALL SET EQUAL TO THE NEW PIIECH 


AR TOD VALLE INITIALLY. 


bis Мпм\/ Ор 


1. ATNCOW (NTYPLıK) - (SEE EXPLANATION ABOVE). 


2. СОСЕ (NTYPFeKeAel) = (SEE EXPLANATION ABOVE). 


5. FTLIN (KıNFWPIsS«JSUM) = (SEE EXPLANATION ABOVE) e 


B NN О (3 DATPLEF «Se IS A* IOUT 4 ISTART) - THIS 
SUPROUTINE IS BASICALLY THE SAME AS THF CALC SUBROUTINF WITH 
BEER PITTON THET тт PROVITES FOX THE OvtRLOAN PROBLEM AND 


MOPPECTS Fer IT. 


SEES TEC S S e THIS SUBROUTINE INITIALTZES 


СКЕН yy Fi J Ch JS USED IN THE DETECTION OF ІНЕ 





A 


UvraLOAr PRORLFM, 


С; RFEBREG (TSTART,I«Kk,) = THIS SUPPOUTINE RE-INITIALIZES 


ШЕП rIST^RI ARRAY AFTER A PLEFER SHiFT pAS BFEN PERFORMED. 
OTEER PROSPAMS USED N1TH 1 FSE SYSTEMS 
(SFE THE SHIELD,DUC EXPLANATION; ALL. PROGRAMS ARE THE SAME,) 


A OUTS TO ACAT'TIVE SYSIFM 
1, SI r DAT = EFRON ADDNS 


2. TC IIT ЕКО ЕГ ТОВА 


OUTPUT FROM THE ADAPTIVE SYSTEM 


ШЕ OI TPUT. DAT 


AR ORITHM 


ПИКА Ор па To pASICALLY Tre SAME AS THE SHIELUS+ 


Eu. ONLY THE EYCEPTIC; S ARE LISTEN In THIS DESCRIPTIUN. 


2. THE PITCH, TAB FILE IS INITIAU'LY STORED IN AN ARRAY 
"P" SG EASIER ACCESS TC THE PITCH PERIOD VALUES САМ ВЕ 


PPT NINES 


5, AN PRRAY use TS USED TO HOLD THE SPACING VALUES 


BETWEEN THE COEFFTCIFNTS, THE VARIABLE ISUM IS USED TO 
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EmDOrE THE TENGTH Oor HE FAUER, 


Ф, IM NONVO2 Af ARRAY "ISTA"T" IS USED TO HOLD ІНЕ 
ШЕТТІЛІ STARTI NG VALUES OR A PARTICULAR FILTER BASED ON A 
EN PERIOD VALUE. 4T IS USED JN DETECTING AN  OVFRLUAD 


ШІРГТТІСІ. 


Sr lur OUTPUT POINT FS СОМРШТЕр BY IHE DIFFERENCE 
МИҮТЛТІСІ SEO FUR ІНЕ ppAPTIVE FILTER, IN  NONVO2 ІНЕ 


ШЕСТОМ РРОВСЕМ 15 СОРКЕСЛЕО. | 


Y l o ЕК 1 Sri) ТМ А STRICIAR MANNER -TO IHE 


ӘШЕРІПС" PROGRAM, 


To DURING UNVCICED SECTIONS THE INPUT 15 ATTENUATED BY 


IILI UV ARIABLE, CON, TO OBTAJi' THE OUTPUT, 


D, WHEN THE LOGTCAL | VARIABLE, DONE, IS TRUE, IHE 


ENUSSAP PERFÜMS A SERIFS Of CLOSING STFPS, ANO THEN HALTS., 


ұла 





em 
, 


- 


C 


С 


== 


TITLES  NONy01.FUR 750511 


THIS FROCR/M IMPLFMENIS TRE NONUNTFORMLY SPACED 


ADAPTIVE FILTER WITHOUT CCRRECTING FOR THE OVERLOAD 


он 75. 


SURRCLTINES USED: 


VARI AFL 


l, 


R3 


DT 


WINDOW (NTYPE«K) 
COEF. (NTYPE*sK* A.U) 
FILIN (K,NcWP].S,ISUM) 


CALC —(A.nATRUF, s J K+ IOUT) 


NS CUMPFICIE NTs, 
ПА INPUT VALUES. 

OUTPUT - TEMPOR¿FY OUTPUT ARRAY, 
БЕ НЕЕ ЕО STORE PICA PERTODS, 


Se USES TO > POR SPACINGS BETWEEN CURAS 


NAMEN CIC ADD NOTES ENDT OF SENTENCES 


БІРІ - ТЕ ТЕ UNVOICED ARFAS. 





a 


5. ИЛЕН ТРЕСТ ARRAY "PU, 


AS Er NGTH LE THE FILTER, 


INITTEECEIZATIAN 
WIMFNSION ^(15) 
POE ER S (052 у ,00ТР01 (256) ,Р (1009) ,5(14) 
LOGTCAL DONE, ATIEN 
LATA D^TBRUF,CUTPUT.F,8/*095*04256*x0,3000»*x041"4 x07 
LATA IVAR,JvAR I KVAR/ 2 2 2 / 
TONE = ,FALSE, 
ATIFN = FALSE. 
CREID NSSIGN (24 *RKI YySIGNALeNAT*) 
ЕНЕС ИЕН (3. RKI PITCH. TAR") 
MAMAS ONES RKI OUTPUT. DAT * } 
Peek bate 2 сео гора). IVAR) 
CETTE FILE 3 СО Ота ен Са АГА КО) 
CON = ,3 tATTELNUATION CONSTANT 
CHOOSE 4yNDOW TYpF 
CALL WINDOW (NTYPE sh) 
WCUL/^TF THE COEFFICIENTS 
ШЕБЕР ТУО кее) 
SPECIFY THE IENGTH OF Tht OUTPU! FIIF. 
ПИ АЗИ (K*2): ¿+ 1 
| IP [f a cCULEMQDpO2S6«vUsK VAR ) 
BID THE PITCH ARRAY., 


FO 300 IA = 4,1000 








oe 


REAP (3*UVAR,ENN = 310) P(1A) 
1ге CONTINUE 
D (CONTINUE 

SERAN TRE SPITICH PEROD ARRAY, 
IES PART DETERMIMFS IF YF HAVE A SILENT AREA AT 


Bert CE EGIMMING OF THL FILI, 


~ 


126 JI ЈЕ CORI 200 SILENT AREA? 
ПО А Је YES e SET THF OUTPUT = 0, 
DOS pepe Irre]? 


KA - KO 


mn 


130 ОВЕ СЕБЕБ CO IG 140 
KA S KA e 256 
c6 TO 120 
240 CUTPUT (KA) = 2048 
ИАЕА Дау = 0 
J ле 25660 ТО 150 115 IT TIME то WRITE A REC.? 
BRITEC4H* KY AR) (COUTPU TCKB)«Bz1«256) Sis 
Tap CONTINUE 
SUP THE COUNTER, 
el] 
(0 TO 120 
EE DUCI PETPIOD WAS "OT A SJLENT ONE, 
ern ШГРФ ЕРІ) GO TCO S000 
C VOICEC. Slee whe FF TeV SPACING. 
МӘЗІСІ Son) TO KAA 


CAL! FILIN (heP Cd) eS, TSUM) 
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IE UNSUKEMEMTIS SECTION 


ІК = ISUN 7256 
ТРО (РАР Оз УКЕ ЛО Т) ІК = IR + 1 
ПЕТ UP FOR INITIA nF ACIC INTO BUFFER, 
IVAP = KVAR = IR 
eS OS Ke = 1,16 
REAR (COT IVAR) (CUATBURECKE + ((Kt-1)+256))+KF = 1.256) 
sun CONTINUE 
U SFT UF THF 3UFFER POINTER 
ЈЕ (KA+EQ0,255) КА = -1 RESET THE POINTER 
ITO THE LAST OUTPUT, 
PS TSE COBRA (KE + 2) 
ПИСТЕ DUTPIT CALCULATIO € 
SO СА! СМС (Д,ПАТВОҒ,65.1% Қ. 1007) 


TUTE THE ADDRESS FOR TPE OUTPUT POINT, 


SAC INO > l 

SCF ЈЕ (110, (Е•255) 60 10 370 !00 wF HAVE A GOOD ADDRESS 
THO = IND ~ 256 IND Go RECHECK, 
CO TA 365 

57п АЛЕК “СО 70-575 Powe eA eV OLCED SEGMENT 7 


КИД АРТ пон ни ОСЕ YES, GET CALCULATED VALUE. 
СО ТО ЗРО 

SE TFE LATE THE INPUT. 

ors CUIPHTCIND) =( CON#+( Cp TBUF (1) -2?048))+2048 

580 SPINE) = D SOON mE ANNE = 05 


Een ІС UTE IHE COLIPUT BUFFER? 
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Hee С (арена. 56 10 400 
MRITE (CU KVAR,END=30p0) (COUIPUTCKP).KBRz1,256) IYFS. 
YON ee et 
ES FRE A PITCH FARE? 
ЕАО ОЕ ОС ОТЕ у бо го 1220 
510 (Сирт = РЈ) « 2 EET A NEL) VALUE OF PITCH, 
u = J + 1 !guMP TRE FITCH PERIOD POINTER. 
ЭШ ОКЕ VE ENTERING a SILENT AREA? 
J ЕРІГІП CO то 1600 
СТО. ЛАКЕ WE ENTERING AN UpVvVOTCED AREA? 
СЕЕ ЕТ Соот Gc TO. 3800 
MOS TEEREFORE WE ARE ERTEPING A VOICED AREA. 
GANGE THE FILTER., 
па том еп TO KXX 
NES THE FILTER BEING INITIALIZED AFTER AN UNVOICED AREA? 
ШАРЛ АЕ РЈУ GO TO 60 | 
CO TO 900 ТРЕ FILTER IS NUT BEING RE-INITIALIZED. 
acp PIVEN = „FALSE, ПАЗАРА n IE РК ЛС» 
ПО ОРАО yil 5.15077 
CO TO 1220 
аг КДА к= 2: жү. 
1000 Ја GO 10 1100 
S(XAA) = SIKAA ~ 1) ¿SHIFT THE SPACINGS 
ШАЛА = KAA -1 
CO nO S cnn 


1100 O NET torte ir NEW PITCH PERIOD SPACING 





rn 
Dome cocenDmpPOUTE TRE LENGIA OF THE FILTER 


Io UU Eg volt | 
ГО = TSUN ao (RI) 
1200 Lu: 
о (С = Т + 1 
Ms 11 TIFF 10 SHIFT THE - URFFER? 
ІР (1.61.4096) СО ІС 1400 
EEES: SHIFT THE BUFFCP. 
NOE TUM 256 
NN NCIS) IR = IR + 1 
IVAR = IVAR IR 
Палетка = 3416 
NNI DL ОЕ 52) CÚ TO 2909 
СУОЛЕ CDAVDUPUKS + (UK2-T)*256))4.K^5 = 1,256) 
JSON CONTINUE 
Meee Sk] I 
2510 1 с (ІН ж 266) % 1 
1400 CO ТО КХХ, (550%560,1840) 
НИ > PREA RESERVEPR FOR SILENT AREN LATER. 
leno МЕРІ = IApgS(NEVPT) GET RID OF MINUS SIGN 
CONF = TRU, ISETADONE FLAG. 
(0 TO 1800 
EEECISVOTCES- AREA 
Dno I ci | E PEHE POTTER: 
J APAE TSUN (O 19 2000 


1820 | ME d ПАНТА РОС АКАТА Бете 
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ERPS КАВ = 0 
ОГГЕ WE THROIGH ӨҢТЕГІМС? 
1820 ШЕ ГКАЛ сео сеже Ро р) уоог ТО 1855 
KNEW = 2 + K = KAR 
KOLN = KNFW - 1 
S(KXHEV) z S(KOLD) МО CONTINUE 
KAB = MAP 4 1 
CO TO 18390 
qan HORT 
Bs 7 ASSIGN 1840 TO KXX ` 
(СОТО СОУСОМ ЕТЕ АМ OUTPUT, 
1640 KE = KB 2 
SCI CU TC 1837 PARET WE THRU? 
E OK SEU YES, BUMP K7 
I ITIER THE ATTENUATION PRC CEDURE 
l 50 Mei ТЕШЕ. ISET TRE ATTENUATION FLAG. 
JF (ГОМЕ) 60 ТО 1506 PARE WE ALMOST THROUGH? 
ASSIGN 360 Те КХХ Inc, THIS TS ONLY AN UNVOICED AREA. 
СО TO 5еє0 
3 00 ОЕ. 
С ГО WE HAVE A SYATARLF INL? 
1220 J IND, ER. 255), 60 TE 1950 
JNO = ING = 256 INO, TRY AGAIN. 
CCU то ICD 
ІІ CUTTINGS = 204€ SILENT AREA THEREF ORE 


[SET OUTPUT = 2048 (OR 0). 





1270 
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CUTPUT (INP 4 1) = ( 
Io J jO O IC O TIP UTO DUMP OUTPUT BUF? 
WRITFCR' KyAR,.END = 3000) (VUTPUT(KB).KU=11256) 
Ine IRE SEI POINTER 
ИО = ND SES 


cO TO 192n ШОГАШ TND OF FILE, 


C THIS AREA WIPFL НЕ DEALING WITH UNVOICED SEGMENTS WITH 


Ü TSUMSNEAPT. 


Пт) 


E 


ПБ 


2210 


2000 


GO TO 5000 

LOL ODDS emnes de 
(10) ewe ey Seite 254 
LATRUFCGLAX + (CILAX=-1)*255)) = 2048 
CONTINUE 

CONT TNIE 

CU TOSIN 

Sr eee 

IO PLE $ 

FNOFTLE y 

STOP 


END 
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NONVO2 


Initialization 


CALL WINDOW 


CALE COE 


Fill pitch 
array 













Silent no 
area ? 
Fill output 200 T 
file with zeros no Pitch 400>\_yes 







Assign 350 to 
KXX 








SAN 


300 
Fill buffer 
initially 


CALE BEG 


FIGURE A-15 





S 


NONVO2 


390 
CACCKCAELCZ 


Resuit— IOUT 


360 
Prepare Tor 





output address 


370 
no Unvoiced es 
Sediment! 


CXII UIS OUT ОТР GODS 
| NPUT 








Output for 2na 
channel = 2048 





Write output End 
onto disk 


E 
400 






no Change yes 
Date 
no Silent yes 
area? 
ОО TO 1220 = 
Set DONE 
no yes FLAG 
| 1800 | 
Assign Move filter 
350 to Out of voiced 
KEX X area 


CIEN ON T) 
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NONVO2 
I 1850 © 
15 
Set ATTEN 
no c^ ATTEN set ?, Ves 
900 
Shift spacings 
g Clear AT TEN Je 
Assign 360 to 
KXX 
Slide buffer o 
OO TO 
1920 






Fill output file 
with zeros 


CALL FTLIN 





CAEL SEG 


1220 
Bump | 


— 






Shift buffer 


Reset | 





GO TO KXX 
(350 ,360,1840) 






PRETA (C OIN T.) 











С 


ТОРЕ" NONVO2.F UH 750521 


THIS pROCRAM IMPLFNLNIS IFE NONUNIFORMLY SPACEO 


ШПАРТТУ:- FILTER pith THE CORRECIION FOR THE OVERLOAD 


PROBL E Me 


ENRROLTIMES USEN: 


1. 


WINDOW (NTYPE+K) 

COEF (NTYPEeKeAgl ) 

FILIN (kK, NEWPI+«S,ISUM) 

PEG (PSTART«S« I.E) 

Aia А ВА И С ак Ото ТУ ТАВТ) 


ВЕВЕС СТЕТАКТ ЕГ, К) 


ME CONTAINS COEFFICIENTS 

НИНА МАО РАНИ ЛАА АЕ COPE. DURING OVERLOAD, 
DAVRUF - CONT/INS INPUI, 

ООР О ТЕГЕГЕТЕТ ОСТҮРИТ APRAY, 

ПО АЛАТИ А ОНЕ РЕ КО О « 


Ос 57 I] 0 S wor ne (NOUR FILTER, 


VARIATERS: 





C ОЕБС ИИ ПОРЕ о END OF SFNIENCL. 

C Ze ATICN - LOGICAL MENOTES UNVOICED AREAS. 

C ОРО ИС ИЗЕР ИТЕН ТОВ Тор ARRAY "р", 
C НА о ПО. ИАЕА БИТИ АНКА ПН Еро ТЕК, 

C 

C ПАПИ TA ET ZANE ION 


AS ON CA па Ое) 
JNIFGER DAFTBUF (4096) ¿O0UTPU! (256) +P(1000),45(14) 
O дет (15) 
КОТ ГАБ Г OMN ATEN 
LATA DATBUF,PUIPUT,F ,S/4095x*0 ,256+0,1000x0,14x0/ 
CATA ISTART, B/19*0,15*0.07 
САТА IVAR,JVAR«KVAR/2«2«2/ 
РАТА DONF. ATTENZ.F 5L SE. «FALSE, / 
ASSIGN (2: RKI: SIGNAL: DAT?) 
CARTAS STON (3, !RKISPITCH,. FAR?) 
ЕО ЕМ (ЗИС ОРОТ DAT?) 
[ET TE FILE ° (251+2z256 U IVAR) 
LEFINF FILE z (1000, 2 U+  JVAR) 
UT Ns 
C CHOOSE ATNDOW TYPrF 
CALL WINDOW (NIYPL қ) 
NEC ALCUL ATE THE CUEFFICIENTS 
CAA COT ANTYE Vinel) 
C SPECIFY THE LENGTH OF ТІН, OUTPUI FILF, 


ИЕ ТРЕБЕ (К?) + 1 








s z 
L ber Lilie (LIPS Tee aoe Ue KV AR) 


C FILL THe PIICH ARRAY, 
[uS О IA 234-1800 


KEAN (C5* JUAR,.END = 110) РСЈА) 


Inn CONTINUE 
110 CONTINUE 
_ = 1 ty IS THE POINTER IN THE PITCH PEROD ARRAY, 


Peels BART DETERMINES IF LF HAVE A SILENT AREA AT 
ПИЛЕ BEGINMINSG OF THE Flle. 
120 MANAGE POLEO TC 200 SILENT AREA? 
“ = JABS(P(J) * 2) EYES, SH THE OUTPUT = 9. 
ПО О руте паге Је 
E TD 
14n IF (KA.LE.E55) GO Ic 140 
KA = KA = 256 
СОЛТО ЕТП 
1*0 A = ZAT 
CIPOT ДА оту z= Ü 
Ane zS) GO Tc 150 IIS IT TIME TO WRITE А REL,? 
WRITFUS*KyA^R) (COUTFLTUKB)4«^Bz1*4256) !YES, 
Bon CONTINUE 


EE UFP Th. COUNTER, 


~ JD 
(COSMOS IE 
БРТТСЫ PERTOD Was NOT A SILENT ONE, 


200 J I I СЕ•цао) 60 1С 3000 
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NE GU ЦЕР tur FILTER® SPACING. 


ASSIGN 350 TO КХХ 
(m ош СУ ТУЫМ) 
MINE A SURESIEMTS SECTION 
ІҢ - ІЗІМ /2<6 
DCN СЕТТІ ІН с ІК 41 
C SET Ur FOP JNITIAL REAPINC INTO BUFFER. 
IVAR = KVA—R ІК 
LOGO SOO KE ZO TE 
КЕКАП(Р"ІУЛЕ) (ЮЛТЕОЬТ(КЕ ж ((КЕҒ-1)ж256)), КЕ = 1:256) 
S CONTINUE 
C SET UF THF 3UFFER POINTER 
JF (KA.EG,255) KA = -1 BRESEIZIHE SPOTNTER 
IOF THE LAST OUTPUT, 
1: (ІЗ ж 254) + Ch, + 2) 
C ThITI¢LIZE IE ISTART ARRAY 
CALL ВЕС (15ТАІКТ%63417.К) 
ӘНІ TEE OUTPUT CALFULATIOES 
Son Cree eee oN shies Se lake TOUT TST ART ) 


(I IPUTE THE ADDRESS FOR 1HE OUTPUT POINI, 


ser prole cl 

OF, MAI DO C79 pO RE HAVE A GOOD ADDRESS 
IND = IND - 256 IND GO КЕСНЕСК» 
(0 TO 365 

Sn EEG DEOS SS UDLSSOTHLSCA UOICED SEGMENT? 


(РА РАТ ОИ -> IOUT TESST CTT LCAICULATED VALUES 
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СО ТС 560 
Pee telah THE INPUT, 
475 CUIPUTtIND) = (COP + (DATBUF( I) -~204%8)) + 2048 
586 COPs) 0 Pee motte = 0, 
TS IT ТІРЕ ТО DUMP THE OUTPUT BUFFER? 
TT CINE GO TO FOO 
WRITFOS*KVAK,ENDZAZCGO) COUSPUT (KB) eKB=1,256) !YES. 
Фер JT = I + 1 
RTS THERE A PITCH MAKK? 
Us (DATBUF(1) ¿LT+4075) GO ВОЗ саб 
500 ПЕН (jeer NS ОЕТ ОА NEW VALUE OF PITCH, 
EEUU Uu SUESPIFCHETSERTOR POINTER, 
C ARE WE ENTERING A SILENT AREA? 
J I l rl 1. 0500 TC 1600 
EEUU ARE WE ENTERING AN UNVOICED AFEA? 
ЕЕРЕЕ ЕОС ОВОО 
OS TEEXEFORE WE sRE EPTERPING A VOICED AREA. 
EE ANCE IPE FILTER, 


ASSTGN 350 TO KXX 


57) 
= 

EA 
— 


cee neler SENG INIVIP LIZED AFTER AN UNVOICED AREA? 
ТР (ATI rN) Gn TO A606 

(OP ERES NOT BEING RE-INTTIALIZED. 
BON Hie w=. EASE | КЕПЕ АК THE FLAG, 

EAE TO SLIDE THE BUFFER? 

ТЕСТЕ СЧЕТУ жоюуу GO TU R20 


cuo ЕІ БУ ТЕ ГГЕ POINTER, 











ein 
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IHAF = NEWPI * 2 3 k 


LES CHINE 1) 725€ 

AS ИЕ ИВЕТ) IRA = IRA + 1 

IVAR = JVAR - 16 - IFA 

O S 

HERO TVAR (ПЛТЕГГСТОХ + (CIDY-1)«x256) ) 4 TOX21:256) 


( ONTINUF 


RESET THE POINTER 


коп 


an 


1000 


1100 


Ep 


1220 


1 5 НОО + (256 * JRA) 

AFANES ANE WE 172) 4 5: TSUM) 

MADE ESTAR To. 114) 

GG TO 1220 

[TOM x K 

ТЕСКАЛЕС +) GO 10 1100 

Scie = SIKAA = I} SHIFT THE SPACINGS 
KAN = КАД 1 

CORTO ооо 

<(1) с МЕМРІ !SE!I ЊЕГ NEW PITCH PERIOD SPACING 
І5ЈМ = 0 !СОМРЏТЕ ТЊЕ LENGIH OF THE FILTER 
BO А ОМА ИК = eee 

үл TS + о 5{К1) 

CONTTNUE 

CALL BEG CLSTART+Ss] 416K) 


Пи. 


MEUS ET TIN TO SHIFT THE LUFFER? 


ТЕК Г ЕТО) СО Tc 1400 





u P LU — Е Е mE 
ü I I 
y 








- 250 
NES, SHEET !HE PUFFEP. 


ЈК = 150М / 256 
ПАС ОТА Nt. Lo) IR = IP + 1 
IVAP = IVAR = ER 
(0 1200 K2 = q 16 
TEE VARIGE 252) (0 T0 2900 
kKEAD (2*JyARn) (CATbLE (KS *€ ((K2-1)*256))4K35z14256) 
TENN ( ONTINUF 
ШЕЕГСЕТ I 
2510 J = (IR ж 256) 4 1 
С РЕЅЕТ ТАГ ISTAR! ARRAY. 
CALL REREG (TSTARTe) К) 
1400 GO TO KxX, (356:5€0,1840) 
FII IS AREA RESFRvEP FOR SILENT AREA LATER, 
1600 МЕЯРІ = Tabs pt Wel) IGET RID OF MINUS SIGN 
PONF = TRUE. ISET DONE FL AG, 
CO TG E00 
Meee VOICED AREA 
hal !LUMP THE POINTER, 
I n SUI еп ог ес о 
1220 К7 = 1 lo PD A COUNTER, 
uot Dn 
E DE Lt THROUEH SHTETINC? 
ie 7 0 IF (KAB.EQ.((2xK)-h7)) GO !O 1825 
КМЕМ = 2 x K - KAP 


КӨШІ = KNEW <= i 
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S(AHEW) = S((CKOLD) INO CONTINUE 
PAORS КАР 4 4 
CO TO 1839 
ust, Ju ceo 
[0 18365 ITE = а се Ка 
1SJm = 1Sym + S(ITE) 
ПАСА Є. CONT DRE 
СОО ВЕСУ ОСТИ SIT YI) 
Коте и 
627 боз Run TO KXX 
GO TO SRO IGO COMNFUTF AN OUTPUT, 
16840 Ка = KB + 2 
IT КО Т (лу) GO WNO 1837 IARE WE THRU? 
K7 = KI + 1 TIES BUME КУ 
IF (K7 ~ (2¢K)) 1826,1635,1850 PARE WE THRU? 
NENENTIER THE ATITENUATIOM PRCCEOURE 
LREN Pulpit boa" eT RiP ISET THE ATTENUATION FLAG. 
ТЕЛ (ПОЛ GO TO 1706 IARE WE ALMOST THROUGH? 
ASSTCN 36n то КХХ ВС. THIS IS ONLY AN UNVOICED ARCA. 
CO TO 360 
1900 ND = l 
END C wr НАЛЕ A SUITABLE INL? 
520 ПЕШТЕ р Бе) CC OSO 
IND = О - 256 INU, TRY AGATN. 
cO TO 1°20 


ЕЕЕ (С CUCI NDI) = Zoue SID D ШАКЕЛ, THEREFORE 








270 


B DU PUT =5 2048 COR 0). 
CUIPUT (IND + 1) = & 


IJF ТМО КЕ. 285) 60 3C 1970 !ТІМР TU DUMP OUTPUT BUF? 


L Tr АН nr Eh = 000) '(DUTPUT(—(KD) Küu=) 256) 
052-521 Ia cup T EIS 
INI SUL TD 


CO TO J62n КОО ГУ UNTIL END OF FILE. 


ELS AKE VICU RE DEALING WITH UNVOICED SEGMENTS WITH 


C JSUMSNEAPT. 


посо 


cnp 


2300 


2905 


5270 


S000 


vum C comm 
FORMAT (*'* *,*pROGhHAR EXITEU FROM 2000*) 
CC TO 5000 


ро сез” ТЕЛУ» К2,16 


L02905 JLAX = T296 

LAVPRUFLUJLAX. & (CILAX-10*255)) = 2048 
CONTINUE 

CONTINUE 

CHTO 131 

ENQOFTLE 2 

FNOFTLE 3 

Lt NOFTLE ü 

STOP 
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Initialization 


Set'PSTART 
array TO Zero 


Set PSTART 
array tor the 
initial positions 
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OEE. CEC TOR 790323 


FROGPAM IS Aa SURKOUTINL FOR THt NONYOD?, FOR PROGRAM, 


TILALIZES tHe ISIAKY ARRAY WHEN CALLED. 


INITIALIZATION 

SUBROUTINE BFG OTST TSS ISK) 
TOE ERER АРТ с 

ШО ЕСТ 

II A 
РТО 0 

СОМТТМЈЕ 

DEI ZEILE ARPAY WITI TAHE SPACINGS, 
Ion 1 

| Ет 

РАЈ те око ODT 

КМО = KNO ~ S(0 - 1) 

FSTART (J) = KNP 

CONTTNUE 

RETOURN 


t NO 





dg 


Ris ele 


Initialization 


Shift values 
in | START array 





FIGURE A-17 
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REPEC. EOR 750326 


ПЕШЕР ИЕТ ЕБИНЕ ISTART ARRAY AFTER 


EE I DoD ІНЕ ПЛЕРЕГК ГТГЕСФФГЕНР PERFORMED, 


МО А А ГТО 


SUSROUTINE REPEG LU1START.T«K) 


INTEGER 
EEREOKM THE | 
ГО 20 У 
JSTAP T 
CONTINUE 
IE DIET 


E ND 


JSIAPT(15) 
XCHANGE OF ТРЕ ELEMENTS IN ISTART, 
zen T1 


(ly EEE А Бутан ISTUD JE) 





pado 


CALC2 


Initialization 







overloading 
occurred? 


Tenor! 
coefficients 









Compute rescale 
factor 










Rescale 






coefficients 


Implementation 


of difference 
equation 
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Tl ii c Cnr o FOR 790524 


THIS SUBROUTINE IS USED LITH THE MAIN PROGRAM NONVO2 
APO те O O CUTE AN OUTPUT POINT, XT ALSO 
PFTERMINFS Jf THE COFFFILIENTS NEFD TO BE "TURNED OFF", 
PARAMETERS? 
ARRAYS: 

1 • IS CONTTINS ThE COEFFICTENTS 

2.  DATRUF - CONTAIAS IHE INPUT DATA 
e S = CONTAINS The SPACINGS RETWEEN THE 

COEFFILTENTS, 
4e TSIART - CUMTAILS THE INITIAL LOCATIONS 
OF THE COEF. FCR THE PRESENT FILTER. 

DARIAPLES: 

1. J = POINTER OF TE CURKENT PROCESSING IN DATBUF 

2. К.Т: "VALUE ГПАТ ТГТЕПНПЫГИЕЕ THE МО. OF COFF. 

Se TONE VALUE RETURNED: TO THE MAIN PROGRAM, 


ШЕСТОРО УРА ИЕ РЕНЕ ОМ ONE POINT, 


SUBPOUTINE CALC? ('AUATBUF* S 2 «K* IQUT 1 START) 
ШОО БИО Ил ан Ве) 
ONT CES EERIE (HO ) S (15) Ss ISTART (15) 
Пол НЕ 

* SCALE = «FALSE. 


A Tote toy peo ty VE BEEN REACHED, 
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ees] 


ИІ 2 1) 

[С 10 11Р = 2,(2*#К) + 1 

UND ee, 
Ө ЛАА И А АСОВА О ТЕ ТНАТ HAS PFSSED ITS 
AREAS? 

IE (INO GE TSTAR TI - 113 GU TO £ 

E СТР) 


EO TOPI 


e E(IP) 0,0 MESS ПАОЛА ПК THE SCORE, 


Ce = a TRUF, 


t 


It CONTINUE 
ПАША КОЛА НЕ О ТО ТАЛОО WESMEED ТО RESCALE? 


SCALE = FALSE,» БИН КЕССПЕЕ, 


ЖИЕГГЛРОТЕ THE БКЕССДІР ҒАСТСР 
ПОСЛИ th =] tert. ed 
SONT =5 SOM «+ Бет 

ees CONTINUE 

Meee oCALE THE COEFETECTENTS 
ПА У ложе 1 
РАСА ЈА ЕРЕ БСА 

17, CONTINUE 

EN UPDTAUTON-OF Tue OUTPUT POINT 


18 Y 0.0 


и 


OA EA) 





2N 


INDFX = 7 

OSO IY 2 So oak) a] 

INOFX - INPEY - SY -1) 

Jc SUED IC e АРМ) 
CONTINUE 

lO TIP f i P FES) 

RE TURN 


END 
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